Novel macromolecule transduction domains and methods for identification and uses thereof

ABSTRACT

The present invention discloses novel macromolecule transduction domain (MTD) peptides which facilitate the traverse of a biologically active molecule across the cell membrane. Also disclosed are polynucleotides encoding the MTD peptides, methods of identifying the MTD peptides; methods of genetically engineering a biologically active molecule to have cell permeability by using the MTD peptides, methods of importing a biologically active molecule into a cell by using the MTD peptides, and uses thereof.

TECHNICAL FIELD

The present invention relates to novel macromolecule transduction domain (MTD) peptides which facilitate the traverse of a biologically active molecule across the cell membrane, polynucleotides encoding the same, methods of identifying the same, methods of genetically engineering a biologically active molecule having cell permeability by using the same, methods of importing a biologically active molecule into a cell by using the same, and uses thereof.

BACKGROUND ART

Cellular internalization of macromolecules, such as DNA, RNA, proteins, oligonucleotides, and peptides, is still a challenging task because of the presence of the plasma membrane, which constitutes an impermeable barrier for such molecules. Numerous difficulties have been encountered in delivering such molecules to a desired target, including poor penetration into a tissue or cell, toxicity when delivered systemically due to the insufficient specificity of targeting to a particular tissue or cell, side effects when delivered in a high concentration in order to achieve an adequate local concentration at a particular target cell or tissue, and degradation such that inadequate amounts are delivered to the target and/or such that byproducts of degradation result in undesirable side effects.

In order to circumvent these problems, several carrier-mediated delivery systems have been developed. Among them, much attention has recently been given to the use of peptide-based delivery systems. The use of peptides with cell permeability has several advantages, which are mainly due to the various modifications that can be made to the peptide sequence. This allows the engineering of carriers that can address different cellular subdomains and/or are able to transport various types of cargo molecules.

Many cell permeable peptides are designed from sequences of membrane-interacting proteins, such as recombinant proteins, signal peptides, transmembrane domains, and antimicrobial peptides. Within these sequences, short sequences called protein transduction domains (PTDs) have been proved to efficiently cross biological membranes without the need of a carrier or a receptor and to deliver peptides or proteins into intracellular compartments. A number of studies have suggested that the use of PTD-based peptides could be of major importance for therapies against viral diseases or cancers. Among the PTD-based peptides, the third helix of the homeodomain of antennapedia called penetratin (Joliot, A. and A. Prochiantz, Nat. Cell Biol. 6(3):189-96 (2004)), the Tat peptide derived from the transactivating protein Tat of HIV-1 (Wadia, J. S. and S. F. Dowdy, Curr. Opin. Biotechnol. 13(1):52-6 (2002)), transportan (Pooga et al., Faseb J. 12(1):67-77 (1998)), and VP22 (Elliott, G and P. O'Hare, Cell 88(2):223-33 (1997)) have been shown to improve the cellular uptake of peptides, proteins, and oligonucleotides.

A second category of cell-penetrating peptides, called amphipathic peptides, has also been described. An amphipathic molecule can be defined, in short, as consisting of two domains: a hydrophilic (polar) domain and a hydrophobic (non-polar) domain. For peptides, the amphipathic character can arise from either the primary structure or the secondary structure. Primary amphipathic peptides can be defined as the sequential assembly of a domain of hydrophobic residues with a domain of hydrophilic residues. Secondary amphipathic peptides are generated by the conformational state which allows the positioning of the hydrophobic and hydrophilic residues on opposite sides of the molecule.

Other peptides, such as polyarginine-based peptides, calcitonin-derived peptides, and oligomers, have also been proposed as tools for intracellular delivery of therapeutics.

However, the currently known delivery systems appear to be limited due to their lack of efficiency and/or their toxicity, and little is known about the pathway of their cellular uptake, constituting a handicap for improving their efficiency. In addition, a number of delivery systems are limited in their ability to cross cellular and nuclear membranes. Even where such delivery peptides do cross cell membranes, they are often limited in their efficacies due to their entrapment in endosomes.

The present invention is directed to overcoming these deficiencies in the art.

DISCLOSURE Technical Solution

The present invention relates to isolated macromolecule transduction domain (MTD) peptides capable of mediating the transport of a biologically active molecule into a cell.

Another aspect of the present invention relates to isolated polynucleotides encoding such MTD peptides.

The present invention also relates to a method of identifying a MTD peptide having cell permeability.

Another aspect of the present invention relates to a method of genetically engineering a biologically active molecule having cell permeability by attaching a MTD peptide to a biologically active molecule.

Still another aspect of the present invention relates to an isolated recombinant protein having cell permeability comprising a MTD peptide having cell permeability and a biologically active molecule.

The present invention also, relates to a pharmaceutical composition for delivery of a biologically active molecule to a cell comprising a cell permeable recombinant protein where a MTD is attached to a biologically active molecule.

Another aspect of the present invention relates to a method of transporting a biologically active molecule into a cell in a subject comprising administering to a subject a cell permeable recombinant protein comprising a MTD peptide attached to a biologically active molecule.

Other aspects of the present invention relate to the use of MTD peptides for drug delivery, vaccine administration, protein therapy, and gene therapy.

ADVANTAGEOUS EFFECTS

The present invention provides MTD peptides having cell permeability which can facilitate the transport of a biologically active molecule into a cell and, thus, can be effectively used for drug delivery, vaccine administration, peptide therapy, and gene therapy.

DESCRIPTION OF DRAWINGS

FIGS. 1 a to 1 o show tables illustrating the amino acid sequences and structural characteristics of macromolecule transduction domain (MTD) peptides identified according to the method of the present invention.

FIG. 2 is a schematic diagram illustrating the His-MTD-EGFP recombinant protein having cell permeability and constructed according to the present invention.

FIGS. 3 a to 3 c are photographs of an agarose gel electrophoresis analysis showing DNA fragments encoding MTDs-EGFP amplified by PCR according to the present invention.

FIG. 4 a is a schematic diagram illustrating the subcloning of a DNA fragment encoding MTD-EGFP into the pGEM-Teasy vector according to the present invention, whereas FIGS. 4 b to 4 d are photographs of an agarose gel electrophoresis analysis showing DNA fragments encoding MTDs-EGFP subcloned into pGEM-Teasy vector according to the present invention.

FIG. 5 a is a schematic diagram illustrating the cloning of a DNA fragment encoding MTD-EGFP into the pET 28(+) vector according to the present invention, whereas FIGS. 5 b to 5 c are photographs of an agarose gel electrophoresis analysis showing DNA fragments encoding MTDs-EGFP cloned into the pET 28(+) vector according to the present invention.

FIGS. 6 a and 6 b are photographs of a SDS-PAGE analysis illustrating the inducible expression of His-MTD-EGFP recombinant proteins according to the present invention.

FIGS. 7 a and 7 b are photographs of a SDS-PAGE analysis showing the purity of His-MTD-EGFP recombinant proteins under denaturing conditions according to the present invention.

FIG. 8 is a photograph of a SDS-PAGE analysis showing the purification of His-MTD-EGFP recombinant proteins under renaturing conditions according to the present invention.

FIGS. 9 a to 9 g are graphs illustrating the cell permeabilities of His-MTD-EGFP recombinant proteins analyzed by flow cytometry according to the present invention.

FIGS. 10 a to 10 i are photographs visualizing the cell permeabilities of His-MTD-EGFP recombinant proteins by confocal laser scanning microscopy according to the present invention.

FIGS. 11 a and 11 b are graphs comparing the cell permeabilities of His-MTD-EGFP recombinant proteins according to the present invention with a positive control.

FIGS. 12 a to 12 i are photographs visualizing the in vivo distribution of His-MTD-EGFP recombinant proteins by fluorescence microscopy according to the present invention.

FIGS. 13 a to 13 k show tables summarizing the characteristics of the MTD peptides identified according to the method of the present invention.

DETAILED DESCRIPTION

The present invention relates to novel macromolecule transduction domain (MTD) peptides having cell permeability that facilitate the traverse of a biologically active molecule across the cell membrane. The MTD peptides of the present invention are cell permeable polypeptides capable of mediating the import of a biologically active molecule, including polypeptides, protein domains, or full-length proteins, through the cell membrane. The MTD peptides of the present invention are characterized as having a single hydrophobic region at their N-terminus, forming a helix structure, exhibiting flexibility, and having relatively short amino acids (7 to 17 amino acids) in length (see FIGS. 1 a to 1 o).

One embodiment of the present invention relates to isolated MTD peptides having cell permeability, where the MTD peptide has an amino acid sequence selected from the group consisting of SEQ ID NOS: 1-193, analogs, derivatives, amidated variations, and conservative variations thereof.

A person having ordinary skill in the art can make similar substitutions to obtain peptides having greater cell permeability and a broader host range. For example, the present invention provides peptides corresponding to amino acid sequences SEQ ID NOS: 1 to 193, as well as analogues, derivatives, and amidated derivatives thereof, as long as the cell permeability of the peptide remains. Minor modifications to the primary amino acid sequence of the peptides of the present invention may result in peptides which have substantially equivalent or enhanced cell permeability, as compared to the specific peptides described herein. Such modifications may be deliberate, as by site-directed mutagenesis, or may be spontaneous. All of the peptides produced by these modifications are included herein, as long as the cell permeability of the original peptide still exists or, in the case of amidated versions of the peptide, the cell permeability of the original peptide is enhanced or altered such that the amidated peptide is therapeutically useful. It is envisioned that such modifications are useful for altering or enhancing the cell permeability of a particular peptide.

Further, the deletion of one or more amino acids can also result in a modification to the structure of the resultant molecule without any significant change in its cell permeability. This can lead to the development of a smaller active molecule which may also have utility. For example, amino- or carboxy-terminal amino acids which may not be required for the cell permeability of a particular peptide can be removed. The peptides of the present invention include any analog, homolog, mutant, isomer, or derivative of the peptides disclosed in the present application, so long as the cell permeability, as described herein, remains. All peptides were synthesized using L-amino acids; however, D forms of all of the peptides can be synthetically produced. In addition, C-terminal derivatives, such as C-terminal methyl esters and C-terminal amidates, can be produced, in order to increase the cell permeability of the peptide of the present invention.

A “peptide” of the present invention includes amino acid sequences that are conservative variations of those peptides specifically exemplified herein. The term “conservative variation” as used herein denotes the replacement of an amino acid residue by another, biologically similar residue. Examples of conservative variations include the substitution of one hydrophobic residue, such as isoleucine, valine, leucine, alanine, cysteine, glycine, phenylalanine, proline, tryptophan, tyrosine, norleucine, or methionine for another, or the substitution of one polar residue for another, such as the substitution of arginine for lysine, glutamic for aspartic acids, or glutamine for asparagine, and the like. Neutral hydrophilic amino acids which can be substituted for one another include asparagine, glutamine, serine, and threonine. The term “conservative variation” also includes the use of a substituted amino acid in place of an unsubstituted parent amino acid, provided that antibodies raised to the substituted polypeptide also immunoreact with the unsubstituted polypeptide. Such conservative substitutions are within the definition of the classes of the peptides of the present invention.

Another aspect of the present invention relates to isolated polynucleotides encoding the above MTD peptides of the present invention. Exemplary polynucleotides encode the MTD peptides having an amino acid sequence selected from the group consisting of SEQ ID NOS: 1 to 193 and analogs, derivatives, amidated variations, and conservative variations thereof. The polynucleotides encoding the MTD peptides of the present invention have nucleotide sequences represented by SEQ ID NOs: 194 to 386.

The polynucleotides of the present invention may be in the form of a RNA or DNA, where DNA includes cDNA and synthetic DNA. The DNA may be single stranded or double stranded. If it is single stranded, it may be the coding strand or non-coding (antisense) strand. The coding sequence may be identical to the nucleotide sequence selected from SEQ ID NOs: 194 to 386 or may be a different coding sequence, where the coding sequence, as a result of degeneracy or redundancy of the genetic code, encodes for the same polypeptide.

The polynucleotides of the present invention also include variants of the above-described polynucleotides, which encode fragments, analogs, and derivatives of the polynucleotides characterized by the deduced amino acid sequences of SEQ ID NOs: 1 to 193. The variant of the polynucleotide may be a naturally occurring allelic variant of the polynucleotide or a non-naturally occurring variant of the polynucleotide.

The polynucleotide of the present invention may have a coding sequence which is a naturally occurring allelic variant of the coding sequence characterized by the nucleotide sequence selected from SEQ ID NOs: 194 to 386. An allelic variant is an alternate form of a polynucleotide sequence which may have a substitution, deletion, or addition of one or more nucleotides, which does not substantially alter the function of the encoded polypeptide.

Sequences similar or homologous (e.g., at least about 70% sequence identity) to the sequences disclosed herein are also part of the present invention. In other embodiments of the present invention, the sequence identity at the amino acid level can be about 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or higher. At the polynucleotide level, the sequence identity can be about 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or higher. Alternatively, substantial identity exists when the polynucleotide segments hybridize under selective hybridization conditions (e.g., very high stringency hybridization conditions), to the complement of the strand. The polynucleotides may be present in whole cells, in a cell lysate, or in a partially purified or substantially pure form.

It is well known in the art that a single amino acid may be encoded by more than one nucleotide codon and that the polynucleotide may be easily modified to produce an alternate polynucleotide that encodes the same peptide. Therefore, other embodiments of the present invention include alternate nucleotide sequences encoding peptides containing the amino acid sequences as previously described. Nucleic acid molecules encoding peptides containing the claimed amino acid sequence include nucleotide sequences which encode any combination of the claimed sequence and any other amino acids located N-terminal or C-terminal to the claimed amino acid sequence.

It is to be understood that amino acid and nucleic acid sequences of the present invention may include additional residues, particularly N- or C-terminal amino acids or 5′ or 3′ polynucleotides, and still be essentially as set forth in the sequences disclosed herein, as long as the sequence confers membrane permeability upon the polypeptide or protein moiety of the recombinant protein.

The present invention also relates to a method of identifying the MTD peptides having cell permeability.

The method of the present invention involves the following steps:

1) identifying secreted proteins having a signal sequence-like domain from multiple amino acid sequence databases;

2) selecting from said identified secreted proteins hydrophobic peptides having a single hydrophobic region at their N-terminus and forming a helix structure; and

3) optimizing and minimizing said selected hydrophobic peptides under conditions effective to produce peptides suitable for use as an MTD peptide.

In order to identify certain candidate peptides that can potentially penetrate the plasma membrane, secreted proteins that have a signal sequence-like domain are identified from multiple amino acid sequence databases, as described in step 1). In a specific embodiment of the present invention, these secreted proteins are selected from the PubMed Entrez Protein Database by using several key words, such as “hydrophobic region of signal sequence,” “signal sequence hydrophobic region,” “signal sequence of secreted protein,” “hydrophobic signal sequence,” and “hydrophobic domain of secreted protein.” As a result, more than 1,500 secreted proteins that have a signal sequence-like domain are identified.

As used herein, the term “signal sequence-like domain” refers to a peptide capable of mediating macromolecule translocation across the plasma membrane.

The term “signal peptide” as used herein is a short (3-60 amino acids long) peptide chain that directs the post-translational transport of a protein. Signal peptides may also be called targeting signals, signal sequences, transit peptides, or localization signals. The amino acid sequences of signal peptides direct proteins (which are synthesized in the cytoplasm) to certain organelles such as the nucleus, mitochondrial matrix, endoplasmic reticulum, chloroplast, apoplast and peroxisome. Some signal peptides are cleaved from the protein by signal peptidase after the proteins are transported. The N-terminal sequence of a secreted protein, which is required for transport through the cell membrane.

Examples of signal peptides may include, but are not limited to, transport to the nucleus (NLS: -Pro-Pro-Lys-Lys-Lys-Arg-Lys-Val-), transport to the endoplasmic reticulum (H₂N-Met-Met-Ser-Phe-Val-Ser-Leu-Leu-Leu-Val-Gly-Ile-Leu-Phe-Trp-Ala-Thr-Glu-Ala-Glu-Gln-Leu-Thr-Lys-Cys-Glu-Val-Phe-Gln-), retention to the endoplasmic reticulum (-Lys-Asp-Glu-Leu-COOH), transport to the mitochondrial matrix (H₂N-Met-Leu-Ser-Leu-Arg-Gln-Ser-Ile-Arg-Phe-Phe-Lys-Pro-Ala-Thr-Arg-Thr-Leu-Cys-Ser-Ser-Arg-Tyr-Leu-Leu-), transport to the peroxisome (PTS1: -Ser-Lys-Leu-COOH), transport to the peroxisome (PTS2: H₂N-Arg-Leu-X5-His-Leu-) (wherein, H₂N is the N-terminus of a protein, and COOH is the C-Terminus of a protein).

Examples of signal sequence-like domains may include, but are not limited to, penetratin (Arg-Gln-Ile-Lys-Ile-Trp-Phe-Gln-Asn-Arg-Arg-Met-Lys-Trp-Lys-Lys), the Tat peptide derived from the transactivating protein Tat of HIV-1 (Arg-Lys-Lys-Arg-Arg-Gln-Arg-Arg-Pro), transportan (Gly-Trp-Thr-Leu-Asn-Ser-Ala-Gly-Tyr-Leu-Leu-Gly-Lys-Ile-Asn-Lys-Ala-Leu-Ala-Ala-Leu-Ala-Lys-Lys-Ile-Leu), Buforin II (Thr-Arg-Ser-Ser-Arg-Ala-Gly-Leu-Gln-Phe-Arg-Val-Gly-Arg-Val-His-Arg-Leu-Leu-Arg-Lys), MAP (model amphiphatic peptide: Lys-Leu-Ala-Leu-Lys-Ala-Ser-Leu-Lys-Ala-Leu-Lys-Ala-Ala-Leu-Lys-Leu-Ala), k-FGF (Ala-Ala-Val-Ala-Leu-Leu-Pro-Ala-Val-Leu-Leu-Ala-Leu-Leu-Ala-Pro), Ku 70 (Val-Pro-Met-Leu-Lys-Pro-Met-Leu-Lys-Glu), prion (Met-Ala-Asn-Leu-Gly-Tyr-Trp-Leu-Leu-Ala-Leu-Phe-Val-Thr-Met-Trp-Thr-Asp-Val-Gly-Leu-Cys-Lys-Lys-Arg-Pro-Lys-Pro), pVEC (Leu-Leu-Ile-Ile-Leu-Arg-Arg-Arg-Ile-Arg-Lys-Gln-Ala-His-Ala-His-Ser-Lys), pep-1 (Lys-Glu-Thr-Trp-Trp-Glu-Thr-Trp-Trp-Thr-Glu-Trp-Ser-Gln-Pro-Lys-Lys-Lys-Arg-Lys-Val), SynB1 (Arg-Gly-Gly-Arg-Leu-Ser-Tyr-Ser-Arg-Arg-Arg-Phe-Ser-Thr-Ser-Thr-Gly-Arg), pep-7 (Ser-Asp-Leu-Trp-Glu-Met-Met-Met-Val-Ser-Leu-Ala-Cys-Gln-Tyr), and HN-1 (Thr-Ser-Pro-Leu-Leu-Ile-His-Asn-Gly-Gln-Lys-Leu).

Next, as described in step 2), hydrophobic peptides that have a single hydrophobic region at their N-terminus and form a helix structure are selected from the secreted proteins identified in step 1). All of the secreted proteins having a signal sequence-like domain that were identified in step 1) are subjected to hydropathy analysis to determine whether they contain a single hydrophobic region (H-region) at their N-terminus, followed by computer-aided genomic and proteomic information analysis to determine whether they form a helix structure. The hydrophobic region usually forms a helix structure, which imparts the protein with membrane-translocating activity.

In one embodiment of the present invention, the SOSUI system, a classification and secondary prediction system for membrane proteins, can be used for the computer-aided genomic and proteomic information analysis. The SOSUI system is a useful tool for secondary structure prediction of membrane proteins from a protein sequence and is freely available on-line (see http://bp.nuap.nagoya-u.ac.jp/sosui). Since the SOSUI system currently requires peptides to be analyzed to have at least about 20 amino acids in length, additional amino acids derived from the N-terminal domain of enhanced green fluorescent protein are added to the end of the signal sequence-like domain. When the SOSUI system is used, 220 hydrophobic peptides that have a single hydrophobic region at their N-terminus and form a transmembrane helix structure are selected.

Finally, the hydrophobic peptides selected in step 2) are optimized and minimized so that they are suitable for use as an MTD peptide, as described in step 3). To optimize the selected hydrophobic peptides, empirical modifications are made to them as follows: i) hydrophilic, nonpolar, and positively or negatively charged amino acids are removed from the selected hydrophobic peptides; and ii) various combinations of five hydrophobic amino acids, i.e., alanine (Ala or A), valine (Val or V), proline (Pro or P), leucine (Leu or L), and isoleucine (Ile or I) are created empirically. Said hydrophobic amino acids are known to provide flexibility to the H-region which may allow the hydrophobic region of the signal sequences to form a hairpin-like loop, resulting in destabilized conformational change of the phospholipid bilayer. Such local and transient formation of the non-bilayer lipid structure induced by contact with the flexible H-region leads to topological transformation.

To minimize the length of the hydrophobic peptides optimized as described above, the following principles are employed for determining which amino acids should be eliminated therefrom: i) non-hydrophobic amino acids at the left or the right side of the hydrophobic peptides are eliminated to minimize the size, while maintaining the hydrophobic region thereof; ii) non-hydrophobic amino acids at the middle or the right side of the hydrophobic peptides are replaced with a hydrophobic amino acid, proline (Pro or P) to provide flexibility or a bending potential; and iii) the number of repeated hydrophobic amino acids is minimized to reduce the size of the hydrophobic peptides. Thus, the MTD candidates are characterized as modified peptides exhibiting hydrophobicity and flexibility, having relatively short amino acids (7 to 17) in length, and forming a helix structure.

According to the method of the present invention described above, novel MTD peptides having an amino acid sequence selected from the group consisting of SEQ ID NOS: 1 to 193 and capable of facilitating the traverse of biologically active molecules across the cell membrane can be identified.

In another embodiment of the present invention, one of the novel MTDs, JO-98 (SEQ ID NO: 98) can be identified by the following process:

i) immunoglobulin domain containing 4 from Homo sapiens is selected from the PubMed Entrez Protein Database as a secreted protein having a signal sequence-like domain;

ii) through a hydropathy analysis, it is confirmed that said protein contains a single N-terminal hydrophobic region;

iii) the hydrophobic region (H-region) is selected from the full-length signal sequences of said protein, which has the amino acid sequence represented by MGILLGLLLLGHLTVDTY

GRPIL (23 amino acids in length);

iv) through a structural analysis utilizing the SOSUI system, it is confirmed that the selected hydrophobic region forms a helix structure;

v) non-hydrophobic amino acids (methionine: M; glycine: G) at the left side of the hydrophobic region are deleted to make the first amino acid of the hydrophobic region a hydrophobic amino acid, hydrophobic peptide that has an amino acid sequence represented by ILLGLLLLGHLTVDTYGRPIL (21 amino acids in length);

vi) basic (histidine: H; arginine: R) and hydrophilic (threonine: T; aspartic acid: D; tyrosine: Y) amino acids are removed from the hydrophobic region to reduce the peptide length to no more than 15 amino acids, resulting in a peptide that has an amino acid sequence represented by ILLGLLLLGLVGPIL (15 amino acids in length);

vii) the structural analysis shows that the modified peptide, as described above, forms a helix structure;

viii) in order to impart flexibility to the peptide, nonpolar amino acids (glycine: G) at the middle and the right sides of the peptide are replaced with a hydrophobic amino acid, proline (P), respectively, resulting in a peptide has an amino acid sequence represented by ILLPLLLLGLVPPIL (15 amino acids in length); and

ix) some of the repeated hydrophobic amino acids at the far right side of the peptide sequence are deleted to shorten the peptide length to 10 or less amino acids.

Therefore, the finally optimized peptide as a macromolecule transduction domain has an amino acid sequence represented by ILLPLLLLP (9 amino acids in length; SEQ ID NO: 98) and forms a helix structure, which has been designated as “JO-98.” The last amino acid, proline, of the JO-98 peptide is a flexible site for interaction with a biologically active molecule to form a recombinant protein.

A further aspect of the present invention relates to a method of genetically engineering a biologically active molecule to have cell permeability by using the MTD peptides.

The therapeutic use of biologically active molecules is often hindered by their low cell permeability. Although biologically active molecules have been shown to be taken up by cells via an endocytic process, the molecules that enter the cell in this manner are usually trapped in endocytic vesicles and degraded in lysosomes.

The MTD of the present invention provides efficient transport of biologically active molecules having a high molecular weight across the cell membrane, whereas other membrane transport peptides previously tested have not been shown to transport molecules of a size greater than approximately 25 amino acids. The MTD of the present invention can be attached to a peptide or polypeptide using conventional methods in the art to enhance the peptide's or polypeptide's membrane permeability. The MTD can be provided in the form of a kit, which includes the necessary components known to those skilled in the art to facilitate linkage of a peptide to a target polypeptide. A target protein attached to the MTD in this manner can then be delivered to the cell either in vitro or in vivo for intracellular import.

According to the method of the present invention, the polynucleotide described above can be inserted into a protein expression vector to produce a protein which can be imported from the exterior to the interior of a cell by the action of the MTD peptides described herein.

An expression vector is genetically engineered to incorporate a nucleic acid sequence encoding a MTD in an orientation either N-terminal and/or C-terminal to a nucleic acid sequence encoding a peptide, polypeptide, protein domain, or full-length protein of interest as a biologically active molecule, and in the correct reading frame so that a recombinant protein consisting of the macromolecule transduction domain and the target biologically active molecule may be expressed. Expression vectors may be chosen from among those readily available for use in prokaryotic or eukaryotic expression systems.

As used herein, an MTD is a macromolecule transduction domain of the present invention, which directs cellular transport of a target protein from the exterior to the interior of a cell. In another embodiment of the present invention, the MTD may comprise an alternate sequence which mediates the import of a peptide or polypeptide through the cell membrane to the interior of a cell.

A target protein is a protein which normally exhibits less than optimal permeability through the cell membrane, but which, when attached either N-terminal and/or C-terminal to an MTD of the present invention, is transported from the exterior to the interior of the cell.

In another embodiment of the present invention, a cleavage site is located between the MTD and the target polypeptide, protein domain, or full-length protein. This site may alternatively be a factor X site or another site that is known to those skilled in the art to relate to the cleavage of the recombinant protein to physically remove the MTD from the subject peptide or polypeptide.

The method of the present invention provides a means for producing proteins having cell permeability for introduction into the interior of the cell, where their actions help to further elucidate cellular control and biosynthesis mechanisms. This method also provides a means for introducing intracellular proteins into cells to produce targeted cellular changes, such as the inhibition of apoptosis by the introduction of Bc1-2. Cell cycle control, for example, can be altered by the introduction of a functional p53 protein product into those cells that have become tumorigenic due to an abnormal p53 protein.

As used herein, the term “biologically active molecule” includes any molecule which, if imported into a cell, is capable of exhibiting a biological effect. Since very large proteins having molecular weights ranging from about 100,000 to about 1 million are exported by cells (e.g., antibodies, fibrinogen, and macroglobulin), very large proteins can be imported into cells by the method of the present invention. Therefore, proteins having sizes ranging from a few amino acids to about a thousand amino acids can be used. A preferable size range for proteins is from a few amino acids to about 250 amino acids. For any molecule, the size can be up to a molecular weight of about 1 million, specifically up to a molecular weight of about 25,000, and more specifically up to a molecular weight of about 3,000. In addition, only those molecules which can be attached to an MTD peptide as a signal peptide, either directly or indirectly, are within the scope of the present invention.

Examples of biologically active molecules include, but are not limited to, proteins, polypeptides, and peptides, which include functional domains of biologically active molecules, such as growth factors, enzymes, transcription factors, toxins, antigenic peptides (for vaccines), antibodies, and antibody fragments. Additional examples of biologically active molecules include nucleic acids, such as plasmids, coding nucleic acid sequences, mRNAs and antisense RNA molecules, carbohydrates, lipids, and glycolipids. Further examples of biologically active molecules include therapeutic agents, in particular those with low cell membrane permeability. Some examples of these therapeutic agents include cancer drugs, such as Daunorubicin, and toxic chemicals which, because of the lower dosage that can be used when administered by this method, can now be more safely administered. Yet another example of a biologically active molecule is an antigenic peptide. Antigenic peptides can be administered to provide immunological protection when imported by cells involved in the immune response. Other examples include immunosuppressive peptides (e.g., peptides that block autoreactive T cells, which are known in the art). Numerous other examples will be apparent to the skilled artisan.

Representative examples of biologically active molecules suitable for the present invention may include enzymes, hormones, transport proteins, immunoglobulin or antibodies, structural proteins, motor function proteins, receptors, signaling proteins and storage proteins in terms of their function; and membrane or transmembrane proteins, internal proteins, external or secret proteins, virus proteins, native proteins, glicoproteins, cleaved proteins, proteins with disulfide bonds, protein complexes, chemically modified proteins and prions in terms of their location and roles.

Standard recombinant nucleic acid methods can be used to express a genetically engineered recombinant protein. In one embodiment of the present invention, a nucleic acid sequence encoding the macromolecule transduction domain is cloned into a nucleic acid expression vector, e.g., with appropriate signal and processing sequences and regulatory sequences for transcription and translation. In another embodiment, the protein can be synthesized using automated organic synthetic methods. Synthetic methods for producing proteins are described, for example, in Methods in Enzymology, Volume 289: Solid-Phase Peptide Synthesis by Gregg B. Fields (Editor), Sidney P. Colowick, Melvin I. Simon (Editor), Academic Press (1997).

In order to obtain high level expression of a cloned gene or nucleic acid, such as a cDNA encoding a MTD peptide, a MTD sequence is typically subcloned into an expression vector that contains a strong promoter for directing transcription, a transcription/translation terminator, and, in the case of a nucleic acid encoding a protein, a ribosome binding site for translational initiation. Suitable bacterial promoters are well known in the art and are described, e.g., in Sambrook & Russell, Molecular Cloning: A Laboratory Manual, 3d Edition, Cold Spring Harbor Laboratory, N.Y. (2001) and Ausubel et al., Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley Interscience, N.Y. (1989). Bacterial expression systems for expressing the MTD peptide of the present invention are available in, e.g., E. coli, Bacillus sp., and Salmonella (Palva et al., Gene 22: 229-235 (1983); Mosbach et al., Nature 302: 543-545 (1983)). Kits for such expression systems are commercially available. Eukaryotic expression systems for mammalian cells, yeast, and insect cells are well known in the art and are also commercially available. In another embodiment of the present invention, the eukaryotic expression vector is an adenoviral vector, an adeno-associated vector, or a retroviral vector.

The expression vector for expressing the cell permeable recombinant protein can include regulatory sequences, including for example, a promoter, operably attached to a sequence encoding the macromolecule transduction domain. Non-limiting examples of inducible promoters that can be used include steroid-hormone responsive promoters (e.g., ecdysone-responsive, estrogen-responsive, and glutacorticoid-responsive promoters), the tetracycline “Tet-On” and “Tet-Off” systems, and metal-responsive promoters. The construct can be introduced into an appropriate host cell, e.g., a bacterial cell, yeast cell, insect cell, or tissue culture cell. The construct can also be introduced into embryonic stem cells to generate a transgenic organism as a model subject. Large numbers of suitable vectors and promoters are known to those skilled in the art and are commercially available for generating the recombinant constructs of the present invention.

Known methods can be used to construct vectors containing the polynucleotide of the present invention and appropriate transcriptional/translational control signals. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo recombination/genetic recombination. See, for example, the techniques described in Sambrook & Russell, Molecular Cloning: A Laboratory Manual, 3d Edition, Cold Spring Harbor Laboratory, N.Y. (2001) and Ausubel et al., Current Protocols in Molecular Biology Greene Publishing Associates and Wiley Interscience, N.Y. (1989).

Host cells suitable for producing a cell permeable recombinant protein include bacterial cells and eukaryotic cells (e.g., fungal, insect, plant, and mammalian cells). Host cells can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or the use of cell lysing agents. Scopes, Protein Purification: Principles and Practice, New York: Springer-Verlag (1994) describes a number of general methods for purifying recombinant (and non-recombinant) proteins. The methods can include, e.g., ion-exchange chromatography, size-exclusion chromatography, affinity chromatography, selective precipitation, dialysis, and hydrophobic interaction chromatography. These methods can be adapted to devise a purification strategy for the cell permeable recombinant protein. If the cell permeable recombinant protein includes a purification handle, such as an epitope tag or a metal chelating sequence, affinity chromatography can be used to purify the protein greatly.

The amount of protein produced can be evaluated by detecting the macromolecule transduction domain directly (e.g., using Western analysis) or indirectly (e.g., by assaying materials from, the cells for specific DNA binding activity, such as by electrophoretic mobility shift assay). Proteins can be detected prior to purification, during any stage of purification, or after purification. In some implementations, purification or complete purification may not be necessary.

In a specific embodiment of the present invention, the expression vector is pEGFP-C1 (Clontech, Mountain View, Calif.), which comprises a polynucleotide encoding a recombinant protein including an enhanced green fluorescent protein (EGFP). The insertion of a polynucleotide encoding an MTD according to the present invention into vector pEGFP-C1, 5′ and/or 3′ to the EGFP gene of the vector enables the expression of a recombinant protein incorporating both the MTD and the EGFP. The MTD, connected N-terminal and/or C-terminal to the EGFP, carries the EGFP protein to the interior of the cell.

The genetically engineered recombinant proteins prepared by the method of the present invention are cell permeable proteins and can be used as protein-based vaccines, particularly where killed or attenuated whole organism vaccines are impractical. The cell permeable proteins prepared by the method of the present invention can also be used for the treatment of diseases, particularly cancer. Cell permeable proteins can be delivered to the interior of the cell, eliminating the need to transfect or transform the cell with a recombinant vector. The cell permeable proteins of the present invention can be used in vitro to investigate protein function or can be used to maintain cells in a desired state.

The MTD of the present invention can be used to deliver peptides, polypeptides, protein domains, or proteins to the interior of a target cell either in vitro or in vivo. The MTD can be attached to the target protein through a peptide linkage formed by the expression of the recombinant protein from a recombinant DNA or RNA molecule or can be attached to the target protein by means of a linker covalently attached to the MTD. A covalent linkage can be used to attach the MTD of the present invention to a non-protein molecule, such as a polynucleotide, for import into the cell.

Another aspect of the present invention relates to a pharmaceutical composition comprising the cell permeable recombinant proteins of the present invention, where the MTD peptide is operably attached to a biologically active molecule.

The cell permeable recombinant protein produced by the method of the present invention may be administered in vitro by any of the standard methods known to those skilled in the art, such as by addition of the recombinant protein to culture medium. Furthermore, it will be appreciated by those skilled in the art that recombinant proteins produced by this method may be delivered in vivo by standard methods utilized for protein/drug delivery, including parenteral administration, intravenous administration, topical administration, aerosol administration or inhalation, oral administration (particularly when provided in an encapsulated form), or by rectal or vaginal administration (particularly when provided in a suppository form).

Examples of administration include, but are not limited to, parenteral administration, e.g., by intravenous injection including regional perfusion, inhalation of an aerosol, subcutaneous or intramuscular injection, topical administration, such as to skin wounds and lesions, direct transfection into, e.g., bone marrow cells prepared for transplantation and subsequent transplantation into the subject, and direct transfection into an organ that is subsequently transplanted into the subject. Parenteral administration, e.g., regional perfusion, if used, is generally characterized by injection. Injectables can be prepared in conventional forms, such as liquid solutions, suspensions, or emulsions. A slow release or sustained release system can also be used, allowing the maintenance of a constant level of dosage.

Further administration methods include oral administration, particularly when the complex is encapsulated, or rectal administration, particularly when the complex is in a suppository form. A pharmaceutically acceptable carrier includes any material that is not biologically or otherwise undesirable, i.e., the material may be administered to an individual along with the selected complex without causing any undesirable biological effects or interacting in a deleterious manner with any of the other components of the pharmaceutical composition in which it is administered.

Depending on the intended mode of administration (e.g., including, but not limited to, intravenous, parenteral, transcutaneous, subcutaneous, intramuscular, intracranial, intraorbital, ophthalmic, intraventricular, intracapsular, intraspinal, intracisternal, intraperitoneal, intranasal, intrarectal, intravaginal, aerosol, or oral), the pharmaceutical compositions may be in the form of solid, semi-solid or liquid dosage forms, such as, for example, tablets, suppositories, pills, capsules, powders, liquids, suspensions, lotions, creams, gels, or the like, preferably in unit dosage form suitable for single administration of a precise dosage. The pharmaceutical compositions of the present invention will include, as noted above, an effective amount of the cell permeable recombinant protein in combination with a pharmaceutically acceptable carrier and, in addition, may include other medicinal agents, pharmaceutical agents, carriers, adjuvants, diluents, etc.

For solid compositions, conventional nontoxic solid carriers include, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharin, talc, cellulose, glucose, sucrose, magnesium carbonate, and the like. Liquid pharmaceutically administrable compositions can, for example, be prepared by dissolving or dispersing an active compound as described herein, and optional pharmaceutical adjuvants in an excipient, such as, for example, water, saline, aqueous dextrose, glycerol, ethanol, and the like, to thereby form a solution or suspension. If desired, the pharmaceutical composition to be administered may also contain minor amounts of nontoxic auxiliary substances, such as wetting or emulsifying agents, pH buffering agents, and the like. Actual methods of preparing such dosage forms are known, or will be apparent, to those skilled in the art.

Administration of a cell permeable recombinant protein produced by the method of the present invention may be performed for a period of time ranging from 10 minutes to 72 hours, particularly when the administration is carried out by the addition of recombinant protein to culture media for in vitro use. For in vivo or in vitro use, the effective administration time for a cell permeable recombinant protein produced by the method of the present invention may be readily determined by those skilled in the relevant art.

For either in vitro or in vivo use, the cell permeable recombinant protein can be administered at any effective concentration. An effective concentration is that amount that results in the importation of the biologically active molecule into the cell. Such a concentration—culture medium concentration (in vitro) or blood serum concentration (in vivo)—will typically be from about 0.1 nM to about 500 μM. Optimal concentrations for the specific recombinant protein and/or the specific target cell can be readily determined according to the teachings herein. Thus, the in vivo dosage for the recombinant protein is the concentration which will cause the blood serum concentration of the recombinant protein to be, about 0.1 nM to about 500 μM, more specifically, about 0.5 nM to about 100 μM. The amount of the recombinant protein administered will, of course, depend upon the subject being treated, the subject's age and weight, the manner of administration, and the judgment of the skilled administrator. The exact amount of the complex will further depend upon the general condition of the subject, the severity of the disease/condition being treated by the administration, and the particular complex chosen. However, the appropriate amount can be determined by one of ordinary skill in the art, using routine optimization, given the teachings provided herein.

Further, the cell permeable recombinant proteins produced by the method of the present invention can be used for vaccine administration (Lingnan et al., Expert. Rev. Vaccines 6: 741-746, 2007; O'Hagan et al., Methods 40: 10-19, 2009).

Vaccines using cell permeable recombinant proteins provide an advantage over the typical peptide vaccines. The immune system recognition of an antigen depends upon appropriate antigen processing. Previously, entire proteins or protein domains could not be delivered to the interior of the cell for processing to occur. As a consequence, peptides representing antigenic epitopes had to be identified prior to the delivery to the cell of small peptides representing those epitopes. The method of the present invention allows whole proteins or protein domains to be imported into the cell, where antigenic processing can occur. This provides multiple antigenic epitopes in one administration, and eliminates the need for experimental identification of specific epitopes for vaccine development.

Typically, such vaccines are prepared for injection into a human or mammalian subject. Injectable vaccines can be prepared as liquid solutions or suspensions. Solid forms can be prepared which are suitable for solution in, or suspension in, a liquid, prior to injection. The preparation may also be emulsified. The active immunogenic ingredient is often mixed with a pharmaceutically acceptable carrier which is compatible with the active ingredient. Suitable carriers include, but are not limited to, water, dextrose, glycerol, saline, ethanol, and combinations thereof. The vaccine may contain additional agents, such as wetting or emulsifying agents, pH buffering agents, or adjuvants which enhance the effectiveness of the vaccine.

The vaccine may be conventionally administered parenterally. Either subcutaneous or intramuscular injection is appropriate. Other modes of administration may include oral administration, nasal administration, rectal administration, and vaginal administration, which may involve combining the peptide immunogen with pharmaceutically acceptable carriers, such as mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, or other carriers. Compositions for oral administration may form solutions, suspensions, tablets, pills, capsules, sustained release formulations, or powders. A protein-based vaccine of the present invention can be administered by an enteric-coated capsule for release of the polypeptide into the lumen of the intestine.

The cell permeable recombinant proteins of the present invention may be formulated into the vaccine as neutral or salt forms. Pharmaceutically acceptable salts include the acid addition salts (formed with the free amino groups of the polypeptide) and which are formed with inorganic acids, such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, mandelic, oxalic, and tartaric acids. Salts formed with the free carboxyl groups of the polypeptide may also be derived from inorganic bases such as, for example, sodium, potassium ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, and histidine.

The vaccine is administered in a manner compatible with the dosage formulation, and in such an amount as will be therapeutically effective and immunogenic. The quantity to be administered depends on the subject to be treated, taking into account, for example, the capacity of the individual's immune system to synthesize antibodies and the degree of protection desired. The precise amounts of the active ingredient (peptide immunogen) to be administered depend on the judgment of the practitioner. Suitable dosage ranges generally require several hundred micrograms of active ingredient per vaccination. Also variable are the regimes for initial administration and booster vaccinations, which should be determined according to the judgment of the practitioner. The dosage of the vaccine will depend on the route of administration and will vary according to the size of the host.

Furthermore, the cell permeable recombinant proteins of the present invention can be effectively used in drug delivery systems (Spencer and Verma, Proc. Natl. Acad. Sci. USA 104: 7594-7599, 2007; Choi et al., Nat. Med. 12: 574-579, 2006; Vives et al., J. Biol. Chem. 272: 16010-16017, 1997; Kumar et al., Nature 448: 39-43, 2007; Jo et al., Nat. Med. 11: 892-898, 2005).

The method of genetically engineering proteins with cell membrane permeability according to the present invention provides a means for delivering therapeutic protein products into a cell. Combination of the present invention with previously described methods of extracellular protein delivery provides a method of delivering proteins for import into a cell in a stabilized, functional form in a controlled-release fashion.

Polypeptides are produced using an appropriate expression vector and an expression system. Cell permeability is conferred upon the protein or polypeptide by the expression of a recombinant protein with the MTD located N-terminal and/or C-terminal to the expressed polypeptide. The less stable proteins are stabilized by methods known to those skilled in the art and described previously. Delivery to the extracellular environment is accomplished by providing the stabilized recombinant protein in an appropriate carrier, such as microsphere carriers. The protein of choice will dictate the appropriate vector and expression system, as well as the appropriate stabilization and delivery techniques. A person of ordinary skill in the art of drug delivery systems can choose the appropriate techniques from among those described.

The method of the present invention provides a means for producing cell permeable proteins for the treatment of cancer. Regulators of apoptosis and cell cycle control have been found to play a key role in oncogenesis, while gene therapy techniques (Balaggan et al., Gene Therapy 13(15): 1153-1165, 2006; Kuo et al., Proc. Natl. Acad. Sci. USA 10; 98(8): 4605-4610, 2001), using intratumoral injection of an adenoviral expression vector encoding the p53 gene, have shown promise for the control of some tumors. However, the delivery of specific protein products through the use of viral vectors has proven to be problematic. The MTDs and methods of the present invention provide a means for producing cell permeable proteins from among the cell cycle regulators and regulators of apoptosis, as well as other proteins identified to play a role in the development of the cancer state.

For example, in one embodiment of the present invention, the polynucleotide encoding the p53 gene can be inserted into a suitable vector, either 5′ or 3′ to the sequence of the MTD of the present invention. Under expression conditions appropriate for the vector of choice and known to those skilled in the art, a recombinant protein comprising an MTD and the p53 protein can be expressed. The attachment of the MTD to the p53 protein renders the p53 protein cell permeable, and protein can be administered to tumor cells to inhibit tumor development. Administration of cell permeable proteins can be accomplished in various ways, including, but not limited to, intratumoral injection, infusion, and intravenous administration. Bax and Bcl-x_(L) are other examples of proteins from among a wide variety of proteins that have been determined to affect cell cycle control and apoptosis and, therefore, be effective for cancer therapy. The method of the present invention provides a more efficient, less labor-intensive, and less costly method for the delivery of anti-oncogenic proteins to tumor cells.

The term “cell membrane” as used herein refers to a lipid-containing barrier which separates cells or groups of cells from the extracellular space. Cell membranes include, but are not limited to, plasma membranes, cell walls, intracellular organelle membranes, such as the mitochondrial membrane, nuclear membranes, and the like.

The term “biologically active molecule” as used herein refers to compounds or molecules that are capable of eliciting or modifying a biological response in a system. Non-limiting examples of biologically active molecules include antibodies (e.g., monoclonal, chimeric, humanized etc.), cholesterol, hormones, antivirals, peptides, proteins, chemotherapeutics, small molecules, vitamins, co-factors, nucleosides, nucleotides, oligonucleotides, enzymatic nucleic acids, antisense nucleic acids, triplex forming oligonucleotides, 2,5-A chimeras, siNA, siRNA, miRNA, RNAi inhibitors, dsRNA, allozymes, aptamers, decoys and analogs thereof. Biologically active molecules of the invention also include molecules capable of modulating the pharmacokinetics and/or pharmacodynamics of other biologically active molecules, for example, lipids and polymers such as polyamines, polyamides, polyethylene glycol and other polyethers. In certain embodiments, the term biologically active molecule is used interchangeably with the term “macromolecule”.

The term “macromolecule” as used herein refers to large molecules (molecular weight greater than 1000 daltons) exemplified by, but not limited to, peptides, proteins, and oligonucleotides and polynucleotides of biological or synthetic origin.

The term “peptide” as used herein refers to a compound made up of a single chain of D- or L-amino acids or a mixture of D- and L-amino acids joined by peptide bonds. Preferably, peptides contain at least two amino acid residues and are less than about 50 amino acids in length.

The term “protein” as used herein refers to a compound that is composed of linearly arranged amino acids attached by peptide bonds, but in contrast to peptides, has a well-defined conformation. Proteins, as opposed to peptides, preferably contain chains of 50 or more amino acids.

The term “polypeptide” as used herein refers to a polymer of at least two amino acid residues and which contains one or more peptide bonds. Polypeptides encompass peptides and proteins, regardless of whether the polypeptide has a well-defined conformation.

The term “nucleic acid” as used herein refers to oligonucleotides or polynucleotides, such as deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), as well as analogs of either RNA or DNA, for example made from nucleotide analogs any of which are in single or double stranded form.

Amino acid residues are herein referred to by their standard single-letter or three-letter notations or by their full names: A, Ala, alanine; C, Cys, cysteine; D, Asp, aspartic acid; E, Glu, glutamic acid; F, Phe, phenylalanine; G, Gly, glycine; H, His, histidine; I, Ile, isoleucine; K, Lys, lysine; L, Leu, leucine; M, Met, methionine; N, Asn, asparagine; P, Pro, proline; Q, Gln, glutamine; R, Arg, arginine; S, Ser, serine; T, Thr, threonine; V, Val, valine; W, Trp, tryptophan; X, Hyp, hydroxyproline; and Y, Tyr, tyrosine.

The term “macromolecule transduction domain (MTD)” as used herein refers to a peptide that facilitates the transport of macromolecules into a cell.

The term “isolated” polynucleotide as used herein refers to a polynucleotide that is substantially free of the sequences with which it is associated in nature. By substantially free is meant at least 50%, preferably at least 70%, more preferably at least 80%, and even more preferably at least 90% free of the materials with which it is associated in nature. As used herein, an “isolated” polynucleotide also refers to recombinant polynucleotides, which, by virtue of origin or manipulation: (1) are not associated with all or a portion of a polynucleotide with which it is associated in nature, (2) are linked to a polynucleotide other than that to which it is linked in nature, or (3) does not occur in nature.

The term “operatively attached” as used herein means that DNA fragments of a promoter, a MTD peptide or other genes are sufficiently connected to direct and regulate the expression of genes.

The term “homology” or “similarity” as used herein refers to the likeness or the percentage of identity between two sequences (e.g., polynucleotide or polypeptide sequences). Typically, a higher similarity in sequences means more similarity in the physical or chemical properties or biological activities.

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any method and material similar or equivalent to those described herein can also be used in the practice or testing of the present invention, specific methods and materials are now described. All publications mentioned herein are incorporated herein by reference in their entirety to disclose and describe the methods and/or materials in connection with which the publications are cited.

It must be noted that as used herein and in the appended claims, the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise.

The publications discussed herein are provided solely for their disclosure prior to the filing date of the present application. Nothing herein is to be construed as an admission that the present invention is not entitled to antedate such publication by virtue of prior invention. Further, the dates of publication provided may be different from the actual publication dates which may need to be independently confirmed.

EXAMPLES

The following examples are presented to aid practitioners of the invention, provide experimental support for the invention, and to provide model protocols. In no way are these examples to be understood to limit the invention.

Example 1 Identification of Novel Macromolecule Transduction Domain Peptides

In order to identify novel macromolecule transduction domain (MTD) candidates that can potentially penetrate the plasma membrane of live cells, secreted proteins having a signal sequence-like domain were selected from the PubMed Entrez Protein Database by using several key words, including “hydrophobic region of signal sequence,” “signal sequence hydrophobic region,” “signal sequence of secreted protein,” “hydrophobic signal sequence,” and “hydrophobic domain of secreted protein.” As a result, more than 1,500 secreted proteins having a signal sequence-like domain were selected.

All of the selected secreted proteins having a signal sequence-like domain were subjected to hydropathy analysis to determine whether they contain a single hydrophobic region at their N-terminus. Subsequently, they were subjected to a computer-aided genomic and proteomic information analysis, i.e., the SOSUI system (as described on http://bp.nuap.nagoya-u.ac.jp/sosui), to determine whether they form a helix structure. Since the SOSUI system currently requires peptides to be analyzed to have at least about 20 amino acids in length, additional amino acids derived from the N-terminal domain of enhanced green fluorescent protein were added to the end of the signal sequence-liked domain. When the SOSUI system is used, 220 peptide sequences having a single hydrophobic region at their N-terminus and forming a transmembrane helix structure were selected.

To optimize the selected peptide sequences as a signal sequence, empirical modifications were made to them as follows: first, hydrophilic, nonpolar, and positively or negatively charged amino acids were removed from the selected peptide sequences and, then, various combinations of five hydrophobic amino acids (i.e., alanine: A, valine: V, proline: P, leucine: L, isoleucine: I) were created empirically.

After that, non-hydrophobic amino acids at the left or right side of the peptide sequences were eliminated to minimize the size while maintaining their hydrophobic region, followed by adding or replacing non-hydrophobic amino acids at the middle or right side of the peptide sequences with proline, to thereby provide flexibility to the peptide. Finally, the number of repeated hydrophobic amino acids was minimized to reduce the length of the peptide sequence. The resulting peptides as MTD candidates exhibited hydrophobicity and flexibility, have relatively short amino acids (7 to 17 in length), and form a helix structure.

According to the method described above, novel 193 MTD peptides having amino acid sequences represented by SEQ ID NOs: 1 to 193, respectively, and are capable of facilitating the traverse of biologically active molecules across the cell membrane were identified. The 193 MTDs were designated as JO-001 to JO-193, respectively, and their structural characteristics and sequences are shown in FIGS. 1 a to 1 o.

Example 2 Expression of Recombinant Proteins Fused to MTDs

Having demonstrated the feasibility of the cellular import of MTD peptides identified in Example 1 above, the enhanced green fluorescent protein (EGFP) was used as a full-length protein cargo molecule. Green fluorescent protein (GFP) is a protein cloned from the jellyfish, Aquorea Victoria. GFP is one of the most widely used reporter proteins and produces a green light when illuminated with blue or UV light (Inouye et al., FEBS Letters 351(2): 211-14 (1994)). In one embodiment of the present invention, a commercially available GFP expression vector, pEGFP-C1 (Clontech), is used. The EGFP protein encoded by pEGFP-C1 is a mutant of wild-type GFP, which has been modified to produce a stronger green light. Further, when a foreign gene is inserted into the multiple cloning sites of the pEGFP-C1 vector, the inserted foreign gene is expressed in the form of a recombinant protein with EGFP.

In order to construct expression vectors for EGFP proteins fused to each novel MTD (His-MTD-EGFP: HME)(FIG. 2), a polymerase chain reaction (PCR) was conducted by using the oligonucleotides described in Tables 1 and 2 below as a primer set (forward primers SEQ ID NOs: 393 to 585 for MTD JO-1 to JO-193, respectively; reverse primer SEQ ID NO: 586 for MTD JO-1 to JO-193) and the pEGFP-C1 plasmid as a template. The conditions for the PCR were 30 cycles of 45 seconds at 95° C., 45 seconds at 68, and 1 minute at 72° C. after initial denaturation of 5 minutes at 95, and final extension of 5 minutes at 72° C. The amplified PCR product was digested with restriction enzyme NdeI and subjected to agarose gel electrophoresis. An MTD-derived from kFGF-4 16 mer (FGF-V, AAVALLPAVLLALLAP, Lin et al., J. Biol. Chem. 275: 16774-16778, 2000; Veach et al., J. Biol. Chem. 279: 11425-11431, 2004), an MTD-derived from kFGF-4 12 mer (FGF-J, AAVLLPVLLAAP, Jo et al., Nat. Biotech. 19: 929-933, 2001; Jo et al., Nat. Med. 11: 892-898, 2005(SEQ ID NO: 389), a protein transduction domain derived from HIV-Tat (HIV-Tat, YGRKKRRQRRR, Schwarze et al., Science 285: 1569-1572, 1999), and HIV-Tat homologous (Hph-1, YARVRRRGPRR) were used as positive controls, and a scramble peptide (SEQ ID NO: 387) was used as a negative control. As shown in FIGS. 3 a to 3 c, it was confirmed that the DNA fragments encoding MTDs-EGFP were successfully amplified.

TABLE 1 PCR forward primer sequences for each MTD-EGFP protein SEQ ID MTD Sequence NO JO-01 5′-CCGCATATGGCGGTGGTGGTGTGCGCGATTGTGC 393 TGGCGGCGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-02 5′-CCGCATATGCCGGTGGCGCTGCTGGTGCTGGTGC 394 TGCTGGGCCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-03 5′CCGGATATGCTGCTGCTGGCGTTTGCGCTGCTGTG 395 CCTGGCGGTGAGCAAGGGCGAGGAGGTG-3 JO-04 5′-CCGCATATGCTGCTGGGCGCACTGGCGGCGGTGC 396 TGCTGGCGCTGGCGGTGAGCAAGGGCGAGGAGCT G-3′ JO-05 5′-CCGCATATGCCGGTGCTGCTGGCGCTGGGCGTGG 397 GCCTGGTGCTGCTGGGCCTGGCGGTGGTGAGCAAGGG CGAGGAGCTG-3′ JO-06 5′-CCGCATATGGCGGCGGCGGCGGTGCTGCTGGCGG 398 CGGTGAGCAAGGGCGAGGAGCTG-3′ JO-07 5-CCGCATATGATTGTGGTGGCGGTGGTGGTGATTGT 399 GAGCAAGGGCGAGGAGCTG-3′ JO-08 5′-CCGCATATGGCGGTGCTGGGGCCGGTGGTGGCGG 400 TGGTGAGCAAGGGCGAGGAGCTG-3′ JO-09 5′-CCGGATATGCTGGCGGTGTGCGGCCTGCCGGTGG 401 TGGCGCTGCTGGCGGTGAGCAAGGGCGAGGAGGT G-3′ JO-10 5′-CCGCATATGCTGGGCGGCGCGG348TGGTGGCGG 402 CGCCGGTGGCGCGGGGGTGGCGGCGGTGAGCAAGGGC GAGGAGCTG-3′ JO-11 5′-CCGCATATGGTGCTGCTGGTGCTGGCGGTGCTGC 403 TGGCGGTGCTGGCGGTGAGCAAGGGCGAGGAGCT G-3′ JO-12 5′-CCGCATATGCTGCTGATTCTGCTGCTGCTGCCGC 404 TGCTGATTGTGGTGAGCAAGGGCGAGGAGCTG-3 JO-13 5′-CCGCATATGGTGGCGGCGGCGGCGCTGGCGGTGC 405 TGCCGCTGGTGAGGAAGGGCGAGGAGCTG-3′ JO-14 5′-CCGCATATGTTTCTGATGCTGCTGCTGCCGCTGC 406 TGCTGCTGCTGGTGGCGGTGAGCAAGGGCGAGGAGCT G-3′ JO-15 5′-GCGGATATGGCGGCGGCGGCGGCGGCGCTGGGTC 407 TGGGGGCGGCGGTGCCGGGGGTGAGCAAGGGCGAGGA GCTG-3′ JO-16 5′-CCGCATATGCTGCTGCTGGCGGCGCTGCTGCTGA 408 TTGCGTTTGCGGCGGTGGTGAGCAAGGGCGAGGAGCT G-3′ JO-17 5′-GCGCATATGGCGCTGGCGGCGGTGGTGCTGATTC 409 CGCTGGGCATTGCGGCGGTGAGCAAGGGCGAGGAGCT G-3′ JO-18 5′-CCGCATATGGCGGCGCTGATTGGCGCGGTGCTGG 410 CGCCGGTGGTGGCGGTGGTGAGCAAGGGCGAGGAGCT G-3′ JO-19 5′-CCGCATATGGCGGCGGCGGTGGCGGTGGCGGGCC 411 TGGCGGCGCTGGCGCTGGTGAGCAAGGGCGAGGAGCT G-3′ JO-20 5′-CCGCATATGATTGCGGTGGCGATTGCGGCGATTG 412 TGCCGCTGGCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-21 5′-CCGCATATGGCGGCGGCGGCGGTGCTGGCGGCGC 413 CGGCGCTGGCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-22 5′-CCGGATATGGCGGCGCTGGCGCTGGGCGTGGCGG 414 CGGCGCCGGCGGCGGCGCCGGCGGTGAGCAAGGGCGA GGAGCTG-3′ JO-23 5′-CCGCATATGCTGGCGGTGCTGGTGGTGCTGGTGC 415 TGCTGCCGGTGAGCAAGGGGGAGGAGCTG-3′ JO-24 5′-CCGCATATGGTGGTGGCGGTGCTGGCGCCGGTGC 416 TGGTGAGCAAGGGCGAGGAGCTG-3′ JO-25 5′-CCGCATATGGCGGCGCTGCTGCTGCCGCTGGTGC 417 TGCTGCTGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-26 5′-CCGCATATGCCGGCGGCGGTGGCGGGGCTGGTGG 418 TGATTGTGAGCAAGGGCGAGGAGCTG-3′ JO-27 5′-CCGGATATGCTGCTGATTGGGGCGCTGCTGCCGG 419 TGAGCAAGGGCGAGGAGCTG-3′ JO-28 5′-CCGCATATGGCGGCGGTGGTGCTGCTGCCGCTGG 420 CGGCGGcGcCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-29 5′-GCGCATATGGGGGCGGCGGCGGCGGCGCTGCTGG 421 TGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-30 5′-GGGCATATGCTGCCGGTGGTGGCGCTGCTGGCGG 422 TGAGCAAGGGCGAGGAGCTG-3′ JO-31 5′-GCGCATATGGCGGCGGCGCTGGCGGCGCCGCTGG 423 CGCTGGCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-32 5′-CCGCATATGCTGCTGCTGGCGCTGCTGCTGGCGG 424 CGGTGAGCAAGGGCGAGGAGGTG-3′ JO-33 5′-CCGCATATGGCGGTGGCGGTGGTGGGGCTGCTGG 425 TGAGCAAGGGCGAGGAGCTG-3′ JO-34 5′-CCGCATATGCTGCTGCTGATTATTGTGCTGCTGA 426 TTGTGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-35 5′-CCGCATATGGTGGCGCTGGCGGCGGCGGTGGTGC 427 CGGTGAGCAAGGGCGAGGAGCTG-3 JO-36 S′CCGCATATGCCGGCGGCGCTGGCGCTGCTGCTGGT 428 GGCGGTGAGCAAGGGCGAGGAGCTG-3′ J0-37 5′-CCGCATATGATTGTGGCGCTGCTGCTGGTGCCGC 429 TGGTGCTGGCGATTGCGGCGGTGCTGGTGAGCAAGGG CGAGGAGCTG-3′ JO-38 5′-CCGCATATGATTGTGGCGCTGCTGCTGGTGCCGG 430 TGAGCAAGGGCGAGGAGCTG-3′ JO-39 5′-CCGCATATGGCGCTGGTGCTGGCGATTGCGGGGG 431 TGCTGGTGAGCAAGGGGGAGGAGCTG-3′ JO-40 5′-CCGCATATGGCGCTGGTGCTGGCGGCGCTGGTGG 432 GGGTGAGCAAGGGCGAGGAGCTG-3′ JO-41 5′-CCGCATATGGCGGCGGCGCTGCTGGCGGTGGCGG 433 TGAGCAAGGGCGAGGAGCTG-3′ JO-42 5′-CCGCATATGCCGCTGCTGCTGCTGGCGCTGGCGG 434 TGAGCAGGGCGAGGAGCTG-3′ JO-43 5′-CCGCATATGGCGCTGGCGCTGGTGGTGGCGGTGA 435 GCAAGGGCGAGGAGCTG-3′ JO-44 5′-CCGCATATGGTGGCGGCGGTGGTGGTGGGGGCGG 436 TGAGCAAGGGCGAGGAGCTG-3′ JO-45 5′-CCGCATATGCCGCTGGTGGCGCTGCTGCTGCTGG 437 TGGTGAGCAAGGGCGAGGAGCTG-3′ JO-46 5′-CCGCATATGGTGGTGCTGGTGGTGGTGCTGCCGC 438 TGGCGGTGCTGGCGGTGAGCAAGGGGGAGGAGGT G-3′ JO-47 5′-CCGCATATGGCGGCGGCGGTGCCGGTGCTGGTGG 439 CGGCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-48 5′-CCGCATATGCCGGCGCTGCTGCTGCTGCTGCTGG 440 CGGCGGTGGTGGTGAGCAAGGGCGAGGAGCTG-3′ JO-49 5′-CCGGATATGGGGCTGGCGATTCTGCTGCTGCTGC 441 TGATTGCGCCGGTGAGGAAGGGCGAGGAGCTG-3′ JO-50 5′-CCGCATATGCCGCTGCTGGGGCTGGTGCTGCTGC 442 TGGCGCTGATTGCGGTGAGCAAGGGCGAGGAGCT G-3′ JO-51 5′-CCGCATATGGTGGTGGCGGTGCTGGCGCTGGTGC 443 TGGCGGCGCTGGTGAGCAAGGGCGAGGAGCTG-3′ JO-52 5′-CCGCATATGCCGCTGCTGCTGCTGGTGCGGGCGC 444 TGGTGAGCAAGGGCGAGGAGCTG-3′ JO-53 5′-CCGCATATGCTGGCGGCGGTGGCGGCGCTGGGGG 445 TGGTGGTGGCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-54 5′-CCGCATATGCTGCTGCTGCTGGTGCTGATTCTGC 446 CGCTGGCGGCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-55 5′-CCGCATATGCTGGCGGTGGTGGTGGTGGCGGCGG 447 TGGTGAGCAAGGGCGAGGAGCTG-3′ JO-56 5′CCGCATATGGTGCTGCTGGCGGCGGGGCTGATTGC 448 GGTGAGCAAGGGCGAGGAGCTG-3′ JO-57 5′-CCGCATATGCTGATTGCGCTGCTGGCGGCGCCGC 449 TGGCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-58 5′-CCGCATATGCTGGCGCTGCTGGTGCTGGGGGCGG 450 TGAGCAAGGGGGAGGAGCTG-3′ JO-59 5′-CCGCATATGCTGCTGGCGGCGGCGCTGCTGCTGC 451 TGCTGCTGGCGGTGAGCAAGGGCGAGGAGGTG-3′ JO-60 5′-CCGCATATGGTGATTATTGCGCTGATTGTGATTG 452 TGGCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-61 5′-CCGCATATGGTGGTGCTGGTGGTGGCGGCGGTGC 453 TGGCGCTGGTGAGCAAGGGCGAGGAGCTG-3′ JO-62 5′-CCGCATATGGTGGCGGTGGCGATTGCGGTGGTGC 454 TGGTGAGCAAGGGCGAGGAGGTG-3′ JO-63 5′-CCGCATATGCCGCTGATTGTGGTGGTGGCGGCGG 455 CGGTGGTGGCGGTGGTGAGGAAGGGCGAGGAGCT G-3′ JO-64 5′-CCGCATATGCCGCTGGGGGTGGCGGTGGCGGCGG 456 TGGCGGCGGTGAGCAAGGGCGAGGAGCTG -3 JO-65 5′-CCGCATATGGCGGCGATTGCGCTGGTGGCGGTGG 457 TGCTGGTGAGCAAGGGCGAGGAGCTG-3′ JO-66 5′-CCGCATATGGCGGCGGCGCTGGCGGCGATTGCGG 458 TGATTGTGAGCAAGGGCGAGGAGCTG-3′ JO-67 5′-CCGCATATGGCGGCGGGGCCGGCGGTGGCGGCGG 459 TGAGCAAGGGCGAGGAGCTG-3′ JO-68 5′-CCGCATATGGTGCTGCTGGCGGCGCTGCCGGTGA 460 GCAAGGGCGAGGAGCTG-3′ JO-69 5′-CGGCATATGGCGCTGCTGGCGGTGGTGGCGGCGG 461 TGAGCAAGGGCGAGGAGGTG-3′ JO-70 5′-GCGGATATGGCGGTGGTGGTGGTGCTGCCGATTC 462 TGGTGGTGAGCAAGGGCGAGGAGCTG-3′ JO-71 5′-CCGCATATGGCGCTGGCGCTGCTGCTGCTGGTGC 463 CGGTGAGCAAGGGCGAGGAGCTG-3′ JO-72 5′-GCGCATATGCTGGTGGTGCTGCTGGCGGCGCTGC 464 TGGTGGTGGTGAGCAAGGGCGAGGAGCTG-3′ JO-73 5′-CCGCATATGCCGGTGCTGCTGCTGCTGGCGCCGG 465 TGAGGAAGGGCGAGGAGCTG-3′ JO-74 5′-GCGCATATGGCGCTGGCGGTGGTGGCGGCGCCGG 466 TGAGCAAGGGCGAGGAGCTG-3′ JO-75 5′-CCGCATATGGTGATTGTGGCGCTGCTGGCGGTGG 467 TGAGGAAGGGGGGGAGCTG-3′ JO-76 5′-CCGCATATGGGGGTGGTGCTGGCGGTGGCGCCGG 468 TGAGCAAGGGCGAGGAGCTG-3′ JO-77 5′-CCGCATATGGCGGTGGCGCTGCTGATTCTGGCGG 469 TGGTGAGGAAGGGCGAGGAGCTG-3′ JO-78 5′-CCGCATATGGTGCTGCTGGCGGTGATTCCGGTGA 470 GCAAGGGCGAGGAGCTG-3′ JO-79 5′-CCGCATATGCTGATTGTGGCGGCGGTGGTGGTGG 471 TGGCGGTGCTGATTGTGAGCAAGGGCGAGGAGCT G-3′ JO-80 5′-CCGCATATGGCGGTGGTGGTGGGGGCGCCGGTGA 472 GCAAGGGCGAGGAGCTG-3′ JO-81 5′-CCGCATATGCTGGCGGCGGTGCTGGTGCTGATTC 473 GGGTGAGGAAGGGCGAGGAGCTG-3 JO-82 5′-CCGCATATGCTGCTGCTGCTGCTGCTGGCGGTGG 474 TGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-83 5′-CCGCATATGGCGGTGGCGCTGGTGGGGGTGGTGG 475 CGGTGGCGGTGAGCAAGGGCGAGGAGCTG -3′ JO-84 5′-CCGCATATGCTGGTGGCGGCGCTGCTGGCGGTGC 476 TGGTGAGCAAGGGCGAGGAGCTG-3′ JO-85 5′-CCGCATATGCTGCTGGCGGCGGCGGCGGCGCTGC 477 TGGTGGCGGTGAGCAAGGGCGAGGAGCTG -3′ JO-86 5′-CCGCATATGCTGGCGGTGCTGGCGGCGGCGCCGG 478 TGAGCAAGGGCGAGGAGCTG-3′ JO-87 5′-CCGCATATGGTGGTGGTGCTGCTGGTGCTGCTGG 479 CGCTGGTGGTGGTGGTGAGCAAGGGCGAGGAGCT G-3′ JO-88 5′-CCGCATATGGTGGTGATTGCGGTGGTGCCGGTGA 480 GCAAGGGCGAGGAGCTG-3′ JO-89 5′-GCGCATATGGTGCTGCTGGTGCTGCTGGCGCTGG 481 TGGTGAGCAAGGGCGAGGAGCTG-3′ JO-90 5′-CCGCATATGGTGCTGCTGGTGCTGCTGGGGCTGG 482 TGGTGAGCAAGGGCGAGGAGGTG-3′ JO-91 5′-CCGCATATGCCGGTGCTGGTGCCGGCGGTGCCGG 483 TGAGCAAGGGCGAGGAGCTG-3′ JO-92 5′-CCGCATATGGCGGCGCTGGCGCTGGCGCTGGCGG 484 TGAGCAAGGGCGAGGAGCTG-3′ JO-93 5′-CCGCATATGGCGGGGGCGGCGCCGGCGCTGGCGG 485 TGAGCAAGGGCGAGGAGCTG-3′ JO-94 5′-CCGCATATGATTGTGCTGCCGGTGCTGGCGGCGC 486 CGGTGAGCAAGGGCGAGGAGCTG-3′ JO-95 5′-CCGCATATGCTGGTGCTGCTGCTGCTGCCGCTGC 487 TGATTGTGAGCAAGGGCGAGGAGGTG-3′ JO-96 5′-CCGCATATGCTGGCGGCGGTGGCGCCGGCGCTGG 488 CGGTGGTGGTGAGCAAGGGCGAGGAGCTG-3′ JO-97 5′-CCGCATATGATTCTGGTGCTGGTGCTGCCGATTG 489 TGAGCAAGGGCGAGGAGCTG-3′ JO-98 5′-CCGCATATGATTCTGCTGCCGCTGCTGCTGCTGC 490 CGGTGAGCAAGGGCGAGGAGCTG-3′ JO-99 5′-CCGGATATGATTGCGCCGGCGGTGGTGGCGGGGC 491 TGCCGGTGAGCAAGGGGGAGGAGCTG-3′ JO-100 5′-CCGCATATGCTGCTGCTGGTGGCGGTGGTGCCGC 492 TGCTGGTGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-101 5′-CCGCATATGCTGATTCTGCTGCTGCTGCCGATTA 493 TTGTGAGGAAGGGCGAGGAGCTG-3′ JO-102 5′-CCGCATATGGCGGTGCTGGCGGCGCCGGCGGTGC 494 TGGTGGTGAGCAAGGGCGAGGAGCTG-3′ JO-103 5′-CCGCATATGCTGGCGCTGCCGGTGCTGCTGCTGG 495 CGGTGAGCAAGGGCGAGGAGCTG-3′ JO-104 5′-CCGCATATGGTGGCGCTGGCGCTGCTGCTGGTGA 496 GCAAGGGCGAGGAGGTG-3′ JO-105 5′-CCGGATATGGTGGCGGTGCCGCTGCTGGTGGTGG 497 GGGTGAGGAAGGGCGAGGAGCTG-3′ JO-106 5′-CGGCATATGGCGGTGGCGGTGGCGCGGGTGGCGG 498 CGGCGGCGGTGAGGAAGGGGGAGGAGCTG-3′ JO-107 5′-CCCATATGGCGGCGGCGGTGGTGGCGGCGGTGCC 499 GGCGGCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-108 5′-CCCATATGGCGCTGCTGGCGGCGCTGCTGGCGCC 500 GGTGAGcAAGGGCGAGGAGCTG-3′ JO-109 5′-CCCATTGCGCTGGCGCTGCTGGTGCCGGTGAGCA 501 AGGGCGAGGAGCTG-3′ JO-110 5′-CCGCATATGGCGCTGCTGGCGGCGCTGCTGGCGC 502 TGCTGGCGCTGCTGGTGGTGAGCAAGGGCGAGGAGCT G-3′ JO-111 5′-CCGCATATGGCGGCGGCGCTGCCGCTGCTGGTGC 503 TGCTGCCGGTGAGCAAGGGGGAGGAGCTG-3′ JO-112 5′-CCCATATGGCGGCGGCGGTGGCGGCGGCGCTGGC 504 GCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-113 5′-CCCATTGGCGGCGCTGGCGGTGGCGGCGCTGGCG 505 GCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-114 5′-CCCATTGGCGGTGCTGGCGGCGGCGGTGCCGGTG 506 AGCAAGGGCGAGGAGCTG-3′ JO-115 5′-CCGCATATGGTGGCGGCGCTGCCGGCGCCGGGGG 507 TGAGCAAGGGCGAGGAGCTG-3′ JO-116 5′-CCGCATATGGCGCTGGCGCTGGCGGTGCCGGCGG 508 TGCTGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-117 5′-CCGCATATGGCGGCGCTGCTGCCGGCGGCGGTGG 509 CGGTGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-118 5′-CCGCATATGGCGGTGGTGGTGGCGCTGGCGCCGG 510 TGAGCAAGGGCGAGGAGCTG-3′ JO-119 5′-CCGCATATGGCGGCGGCGGTGGCGGTGCCGGCGG 511 CGGCGGCGCTGCTGGCGGTGAGCAAGGGCGAGGAGCT G-3′ JO-120 5′-CCGCATATGGCGGTGGTGCTGCCGCTGGCGCTGG 512 TGGCGGTGGCGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-121 5′-CCGCATATGCTGGTGGCGCTGCCGGTGCTGCCGG 513 TGAGGAAGGGCGAGGAGCTG-3′ JO-122 5′-CCGCATATGGTGGTGGTGCCGCTGCTGCTGATTG 514 TGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-123 5′-CCGCATATGCTGGCGGTGGTGCTGGCGGTGGCGG 515 TGAGCAAGGGCGAGGAGCTG-3′ JO-124 5′-CCGCATATGCTGCTGGCGGTGCCGATTCTGCTGG 516 GGGCTGCCGGTGAGCAAGAGGAGCTG-3′ JO-125 5′-CCGGATATGCTGGTGGGGCTGGTGCTGGTGCCGG 517 TGAGCAAGGGCGGGAGCTG-3′ Jo-126 5′-CCGCATATGCTGGTGCTGCTGCTGGCGGTGCTGC 518 TGCTGGCGGTGCTGCCGGTGAGCAAGGGCGAGGAGCT G-3′ JO-127 5′-CCGCATATGCTGCTGGCGCCGGTGGTGGCGCTGG 519 TGATTCTGCCGGTGAGGAAGGGCGAGGAGCTG-3′ JO-128 5′-CCGCATATGGTGCTGGCGGTGCTGGCGGTGCCGG 520 TGCTGCTGCTGCCGGTGAGCAAGGGCGAGGAGCT G-3′ JO-129 5′-CCGCATATGGTGGTGATTGCGGTGGTGCCGGTGG 521 TGGTGGTGAGCAAGGGCGAGGAGCTG-3′ JO-130 5′-CCGCATATGCTGCTGGTGCTGCTGGCGCTGGTGG 522 TGGTGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-131 5′-CCGCATATGGTGCTGCTGGCGCTGCCGGTGGTGG 523 CGGCGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-132 5′-CCGCATATGGCGGTGGTGGTGCCGGCGATTGTGC 524 TGGCGGCGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-133 5′-CCGCATATGGCGGTGCTGGTGCCGGCGGCGGCGC 525 TGGTGCCGGTGAGGAAGGGCGAGGAGCTG-3′ JO-134 5′-CCGCATATGGTGGTGGCGGCGCTGCCGCTGGTGC 526 TGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-135 5′-CCGCATATGGCGGCGGTGGCGCTGCCGGCGGCGG 527 CGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-136 5′-CCGCATATGGTGATTGCGCTGCCGCTGCTGCCGG 528 TGAGCAAGGGCGAGGAGCTG-3′ JO-137 5′-CCGCATATGCTGCTGGCGCTGCCGCTGGTGCTGG 529 TGCTGGCGCTGCCGGTGAGCAAGGGCGAGGAGGT G-3′ JO-138 5′-CCGGATATGATTGTGCCGCTGCTGCTGGCGGCGC 530 CGGTGAGCAAGGGCGAGGAGCTG-3′ JO-139 5′-CCGCATATGCTGCTGCTGGCGCCGCTGCTGCTGG 531 CGCCGGTGAGCAGGGCGAGGAGCTG-3′ JO-140 5′-CCGCATATGCTGGGGGCGCTGCCGGTGGCGGGGG 532 TGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-141 5′-CCGCATATGGCGCTGGCGGTGATTGTGCTGGTGC 533 TGCTGGTGAGCAAGGGCGAGGAGCTG-3′ JO-142 5′-CGGCATATGCTGGCGCTGCTGCTGCCGGCGGCGC 534 TGATTCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-143 5′-CCGCATATGGCGCTGCTGCCGCTGCTGGCGGTGG 535 TGCTGCCGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-144 5′-CCGCATATGGCGATTGCGGTGCCGGTGCTGGCGG 536 CGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-245 5′-CCGCATATGGCGGCGGCGCGGGTGCTGCTGCTGC 537 TGCTGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-146 5′-CCGCATATGGCGGCGGCGCCGGTGCTGCTGCTGC 538 TGCTGCGGGTGAGCAAGGGCGAGGAGCTG-3′ JO-147 5′-CCGCATATGGCGGCGCTGGCGGCGCTGGTGGTGG 539 CGGCGCCGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-148 5′-CCGCATATGGCGGCGCTGGCGGCGGTGCCGCTGG 540 GGCTGGCGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-149 5′-CGGCATATGGCGCTGGCGGTGGCGGCGCCGGCGC 541 TGGCGCTGCTGCCGCCGGTGAGCAAGGGCGAGGAGCT G-3′ JO-150 5′-CCGCATATGCTGGTGGCGCTGGTGCTGCTGCCGG 542 TGAGCAAGGGGGAGGAGCTG-3′ Jo-151 5′-GCGCATATGCTGGTGCTGCTGCTGGCGGTGCTG 543 CTGCTGGGGGTGCTGCCGGTGAGCAAGGGCGAGGAG CTG-3′ JO-152 5′-CCGCATATGCTGCTGGCGCCGGTGGTGGCGCTG 544 GTGATTCTGCCGGTGAGCAAGGGCGAGGAGCT G-3′ JO-153 5′-CCGCATATGGTGCTGGCGGTGCTGGCGGTGGCG 545 GTGCTGCTGCTGCCGGTGAGCAAGGGCGAGGAGCT G-3′ JO-154 5′-CCGCATATGGTGGTGATTGCGGTGGTGCCGGTG 546 GTGGTGGTGAGCAGGGCGAGGAGCTG-3′ JO-155 5′-CCGGATATGCTGCTGGTGCTGCTGGCGCTGGTG 547 GTGGTGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-156 5′-CCGCATATGGTGCTGCTGGCGCTGCCGGTGGTG 548 GCGGCGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-157 5′-CCGCATATGGCGGTGGTGGTGCCGGCGATTGTG 549 CTGGCGGCGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-158 5′-CCGCATATGGCGGTGCTGGTGCCGGCGGCGGCG 550 CTGGTGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-159 5′-CCGCATATGGTGGTGGCGGCGCTGCCGCTGGTG 551 CTGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-160 5′-CCGCATATGGCGGCGGTGGCGCTGCCGGCGGCG 552 GCcICCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-161 5′-CCGCATATGCTGATTGCGCTGCCGCTGCTGCCG 553 AGTGAGCAAGGGCGGGAGCTG-3′ JO-162 5′-CCGCATATGCTGCTGGCGCTGCCGCTGGTGGTG 554 CGGTGCTGGCGCTGCGTGAGCAAGGGCGAGGAGCT G-3′ JO-163 5′-CCGCATATGATTGTGCCGCTGCTGCTGGCGGCG 555 GCCGGTGAGCAAGGCGAGGAGCTG-3′ JO-164 5′-CCGCATATGCTGCTGCTGGCGCCGCTGGTGCTG 556 AGGGCCGGTGAGCAGGGCGAGGAGCTG-3′ JO-165 5′-CGGCATATGCTGGCGGCGCTGCCGGTGGCGGCG 557 AGTGCCGGTGAGCAGGGCGAGGAGCTG-3′ JO-166 5′-CCGCATATGGCGCTGGCGGTGATTGTGCTGGTG 558 AGTGCTGGTGAGCAGGGCGAGGAGCTG-3′ JO-167 5′-CCGCATATGCTGGGGCTGCTGCTGCCGGCGGCG 559 CTGATTCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-168 5′-GCGGATATGCTGGCGGCGGTGGTGCCGGTGGCG 560 GCGGCGGTGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-169 5′-CCGCATATGGTGGCGGCGCCGGGGGCGGCGGCG 561 CCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-170 5′-GCGGATATGGCGGTGCCGGTGCCGGTGCCGCTG 562 GTGAGCAAGGGCGAGGAGCTG-3′ JO-171 5′-CCGCATATGCTGCTGATTCTGCCGATTGTGCTG 563 CTGCGGGTGAGCAAGGGCGAGGAGCTG-3′ JO-172 5′-CCGCATATGGCGCTGGCGCTGCCGGCGCTGGCG 564 ATTGCGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-173 5′-CCGCATATGGCGGTGATTCCGATTCTGGCGGTG 565 CCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-174 5′-CCGCATATGCTGATTCTGCTGCTGCCGGCGGTG 566 GCGCGCTGCCGGTGAAAGGGCGAGGAGCTG-3′ JO-175 5′-CCGCATATGATTGTGCTGGCGCCGGTGCCGGCG 567 GCGGCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-176 5′-CCGCATATGGTGGTGGTGGTGCCGGTGCTGGCG 568 GCGGCGGCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-177 5′-GCGCATATGGTGGTGGCGGTGGCGGCGCCGGTG 569 AGCAAGGGCGAGGAGCTG-3′ JO-178 5′-CCGCATATGCTGGTGCTGGCGGCGCCGGCGGCG 570 CTGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-179 5′-CCGCATATGCTGATTGCGCCGGCGGCGGCGGTG 571 CCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-180 5′-CGGCATATGGCGCTGGCGGCGCTGCCGATTGGG 572 CTGCGGGTGAGCAAGGGCGAGGAGGTG-3′ JO-181 5′-CCGCATATGGCGGTGCTGCTGCTGCCGGGGGCG 573 GCGGTGAGCAAGGGGGAGGAGCTG-3′ Jo-182 5′-CCGCATATGATTGCGCTGGCGCTGCTGCCGCTGC 574 TGGTGAGCAAGGGCGAGGAGCTG-3′ JO-183 5′-CCGCATATGGTGCTGCTGGCGGCGGCGGTGATT 575 GCGCCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-184 5′-CCGCATATGGCGCCGGCGGTGCTGCCGCCGGTG 576 GTGGTGATTGTGAGCAAGGGCGAGGAGCTG-3′ JO-185 5′-CCGCATATGGTGGTGGGCCTGCTGGTGGCGGCG 577 CTGGTGAGCAAGGGCGAGGAGCTG-3′ JO-186 5′-CCGCATATGGCGGCGATTGCGGCGGCGGCGCCG 578 CTGGCGGCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-187 5′-CCGCATATGCTGCTGCTGGCGGTGGCGCGGGTG 579 AGCAAGGGCGAGGAGCTG-3′ JO-188 5′-CCGCATATGCTGATTCTGCTGCTGCCGCTGGCG 580 GCGCTGGTGAGCAAGGGCGAGGAGCTG-3′ JO-189 5′-CCGCATATGGCGCTGCTGCTGCTGGTGCTGGCG 581 GTGAGCAAGGGCGAGGAGCTG-3′ JO-190 5′-CGGCATATGGTGGTGCTGCTGCTGCTGCCGCTG 582 GCGGTGAGCAAGGGCGAGGAGCTG-3′ JO-191 5′-GCGCATATGCTGGCGCTGCCGCTGCTGCTGCCG 583 GTGAGCAAGGGCGAGGAGCTG-3′ JO-192 5′-CCGCATATGCTGCTGGTGCTGCCGCTGCTGATT 584 GTGAGCAAGGGGGAGGAGCTG-3′ JO-193 5′-CCGCATATGCTGCGGCTGCTGCCGGCGGCGCTG 585 GTGGTGAGCAAGGGCGAGGAGCTG-3′

TABLE 2 PCR reverse primer sequence for each MTD-EGFP protein SEQ ID MTD Sequence NO JO-1~ 3′-TTA TCT AGA TCC GGT GGA TCC CGG 586 JO-193 GCC-5′

Referring to FIG. 4 a, each of the PCR amplified products was subcloned into the pGEM-Teasy vector (Promega, Madison Wis., USA) with a T4 DNA ligase, according to the TA cloning method and then E. coli DH5α was transformed, where transformants were selected on LB plate media containing 50 μg/mL ampicillin, IPTG and X-gal. After the recombinant fragment inserted into pGEM-Teasy vector was isolated by treating with the restriction enzyme NdeI, it was subjected to 0.8% agarose gel electrophoresis. As a result, DNA fragments of about 1 kb and vector fragments of about 3 kb were detected, which confirms that the insert DNA of MTD-EGFP was appropriately subcloned into pGEM-Teasy vector (FIGS. 4 b to 4 d).

As described in FIG. 5 a, each of the isolated insert DNA fragments encoding MTDs-EGFP was cloned into the E. coli expression plasmid pET-28a(+) (commercially available from Novagen, Madison, Wis.). The pET-28a(+) plasmids are designed to facilitate His-tag fusions at either the N- or C-terminus and to provide strong expression of the genes in E. coli from the T7 phage promoter. At the 3′ end of each MTD-EGFP encoding gene, the coding sequence was fused in frame at the NdeI site to the His-tag sequence followed by a translation stop codon, which results in the production of MTD-EGFP recombinant proteins with six histidine residues added to the C-terminus for the sake of easy purification on nickel columns.

After the clones were treated with the restriction enzyme NdeI and subjected to agarose gel electrophoresis, it was verified that DNA fragments of about 1 kb and vector fragments of about 5 kb were detected, which confirms the cloning of the insert DNA of MTD-EGFP into pET-28a(+) vector, as shown in FIGS. 5 b to 5 c. Further, out of the 193 novel MTDs, 148 MTDs were found to be successfully cloned for the expression of His-MTD-EGFP recombinant proteins.

Example 3 Inducible Expression and Purification of Recombinant Proteins Fused to MTDs

To express the cell permeable recombinant proteins fused to MTDs prepared as described in Example 2 above, the expression vectors comprising the His-MTD-EGFP recombinant proteins were transfected in the BL21 (DE3), BL21-Gold (DE3), BL21-CodonPlus (DE3) and BL21-GoldpLysS (DE3) strains, respectively. An EGFP expression vector including kFGF4-derived MTD (SEQ ID NO: 387) was used as a positive control, while an EGFP expression vector fused to a scramble peptide (SEQ ID NO: 389) having no function was used as a negative control.

After the transfection, cells were grown at 37° C. in an LB medium containing kanamycin (30 μg/ml) with vigorous shaking until the optical density 600 (OD₆₀₀) reached between 0.4 and 0.6. IPTG (isoprophyl-β-D-thiogalactoside) was then added thereto at a final concentration of 0.6 mM to induce the expression of the His-MTD-EGFP recombinant proteins. Protein induction was prolonged for 3 hours at 37° C. His-MTD-EGFP recombinant proteins expressed in E. coli, with IPTG were loaded on a SDS-PAGE gel, stained with Coomassie Brilliant Blue, and then destained. As illustrated in FIGS. 6 a and 6 b, except for some His-MTD-EGFP recombinant proteins expressed in BL21-GoldpLysS (DE3) strains, most His-MTD-EGFP recombinant proteins were expressed at a high level in BL21-CodonPlus (DE3). Several His-MTD-EGFP recombinant proteins were not expressed.

The inducible expression of His-MTD-EGFP recombinant proteins in an E. coli system leads to the formation of insoluble aggregates, which are known as inclusion bodies. To completely solubilize these inclusion bodies, all of the above expressed proteins were denatured by dissolving them in 8 M urea. Denatured His-MTD-EGFP recombinant proteins were purified by histidine tag affinity chromatography, using a nickel nitrilotriacetate resin (Qiagen, Hilden, Germany). Since strong denaturants, such as 8 M urea, completely solubilize the inclusion bodies, the purification method was carried out under pH-dependent denaturing conditions.

The E. coli culture solutions were harvested by centrifugation at 4,000×g for 20 minutes, re-suspended in a lysis buffer (100 mM NaH₂PO₄, 10 mM Tris.Cl, 8 M urea, pH 8.0), and subjected to ultrasonication on ice using a sonicator equipped with a probe. The cell lysates were centrifuged at 7,000×g for 20 minutes, so as to separate the supernatant and the cellular debris pellet. The supernatant was taken out and then incubated with a Ni-NTA resin equilibrated with the lysis buffer by gently shaking (using a rotary shaker) for 2 hours to overnight. After washing with a washing buffer (100 mM NaH₂PO₄, 10 mM Tris.Cl, 8 M urea, pH 6.3) five times, the proteins bound to the resin were eluted with an elution buffer (100 mM NaH₂PO₄, 10 mM Tris.Cl, 8 M urea, pH 4.5). The His-MTD-EGFP recombinant proteins purified under the denaturing conditions described above were analyzed on a SDS-PAGE gel and stained with Coomassie Brilliant Blue, where the results thereof are shown in FIGS. 7 a and 7 b.

In order to renature the His-MTD-EGFP recombinant proteins purified above, the denatured proteins were refolded by removing the denaturant. Urea was removed from the proteins by dialyzing them against a refolding buffer (0.55 M Guanidine HCl, 0.44 M L-Arginine, 50 mM Tris-HCl, 150 mM NaCl, 1 mM EDTA, 100 mM NDSB, 2 mM Glutathione Oxidized, and 0.2 mM Glutathione Reduced). All of the refolded recombinant proteins were dialyzed for 9 hours against a physiological buffer, such as a cell culture medium (e.g., Dulbecco's modified Eagle's Medium: DMEM). After the replacement of the refolding buffer with DMEM, the cell permeabilities of all of the purified recombinant proteins were ready to be determined in vitro and in vivo. According to the SDS-PAGE analysis results shown in FIG. 8, out of the 148 transformants established as described in Example 2 above, the inducible expression of His-MTD-EGFP recombinant proteins was visually detected in 112 transformants, whereas 4 His-MTD-EGFP recombinant proteins were not expressed enough to be visually detected but were sufficiently purified for cell permeability analysis. The 116 His-MTD-EGFP recombinant proteins were prepared in a soluble form.

Example 4 Determination of Quantitative Cell Permeability of Recombinant Proteins Fused to MTDs

In order to quantitatively determine the cell permeability of the His-MTD-EGFP recombinant proteins, the 116 recombinant proteins purified in a soluble form as described in Example 3 above were mixed with 0.7 μg/μl of fluorescein isothiocyanate (FITC) and reacted at room temperature for 1 hours by stirring. The reaction solution was subjected to a dialysis against Dulbecco's modified Eagle's medium (DMEM; WelGENE Inc., Korea) for 2 days until the FITC was completely removed to thereby obtain FITC-conjugated recombinant proteins. RAW 264.7 cells derived from mouse macrophage were treated with 10 μM of the FITC-labeled protein. The RAW 264.7 cells were maintained in DMEM supplemented with 10% fetal bovine serum and 1% penicillin (500 mg/ml, WelGENE Inc.), and incubated at 37° C. in a humidified atmosphere of 5% CO₂ in air. After the incubation, the cells were incubated with 10 μM of each of the FITC-conjugated recombinant protein prepared above for, 1 hour at 37° C., followed by treating them with Trypsin/EDTA (T/E, Invitrogen, Carlsbad Calif., USA) to remove cell surface bound proteins and washing with cold PBS three times.

The FITC-conjugated His-MTD-EGFP recombinant proteins were subjected to fluorescence-activated cell sorting (FACS) analysis (FACS Calibur, Beckton-Dickinson, San Diego Calif., USA). For each sample, the cells (1×10⁴) were analyzed by using the CeliQuest Pro cytometric analysis software. Each experiment was conducted at least twice. The cell permeable potency of each His-MTD-EGFP recombinant protein fused to novel MTDs (JO-1 to JO-193) was visually compared to that of a positive control protein fused to kFGF4-derived MTD.

FIGS. 9 a to 9 g show the results of a flow cytometry analysis where the gray filled curve represents cell only, the black curve represents FITC only, the blue curve represents the cell permeability of a negative control (scramble peptide), the red curve represents the cell permeability of a positive control (kFGF4-derived MTD), and the green curve represents the cell permeability of each recombinant protein.

In order to evaluate the cell permeability of each MTD in mammalian cells, the cellular uptake efficiencies of the His-MTD-EGFP recombinant proteins were compared with those of the positive control protein fused to kFGF4-derived MTD and the negative control protein fused to a scramble peptide, by evaluating the change in median fluorescence after incubation with each FITC-conjugated His-MTD-EGFP recombinant protein.

Referring to the results shown in FIGS. 9 a to 9 g, none of the cells treated with 10 μM of the FITC-conjugated His-MTD-EGFP proteins for 1 hour at 37° C. underwent a change in median fluorescence such that the median fluorescence after treatment was significantly lower than that of the cells treated with the negative control. In contrast, 80 His-MTD-EGFP recombinant proteins, out of the 111 proteins that were tested, exhibited 0.5-fold (50%) or higher fluorescence signal than that of the positive control. These results suggest that some of the newly developed novel MTDs fused with the cargo molecule EGFP exhibit significantly high levels of plasma membrane-penetrating ability which is enough to deliver macromolecules, such as protein, into live cells.

Example 5 Determination of Cell Permeability and Intracellular Localization of Recombinant Proteins Fused to MTDs

Out of the 111 FITC-conjugated His-MTD-EGFP recombinant proteins tested by flow cytometry, 60 His-MTD-EGFP recombinant proteins were selected for visualization of cell permeability and intracellular localization thereof. Among them, 50 His-MTD-EGFP recombinant proteins exhibited higher cell permeability than 0.5-fold as compared with the positive control protein fused to kFGF4-derived MTD.

NIH 3T3 cells were treated without (cell only) or with FITC (FITC only), or FITC-conjugated recombinant proteins such as a negative control (His-scramble peptide-EGFP), a positive control (His-kFGF4-derived MTD-EGFP) or the recombinant proteins fused to novel MTDs (His-MTDs-EGFP), and visualized by confocal laser scanning microscopy. NIH 3T3 cells were cultured for 24 hours in an 8-well chamber slide (LabTek, Nalgen Nunc, Rochester N.Y., USA). Cells were maintained in DMEM supplemented with 10% fetal bovine serum, 1% penicillin and streptomycin in 5% CO₂ at 37° C. The cells were washed with PBS three times, and then treated for 1 hour with DMEM, DMEM plus free FITC, or DMEM containing 10 μM FITC-conjugated recombinant proteins in 5% CO₂ at 37° C. One hour after the treatment, the cells were fixed in 4% paraformaldehyde (PFA) for 20 minutes at room temperature for observation.

For the direct detection of FITC-conjugated recombinant proteins that were internalized, the cells were washed with PBS three times and counterstained with a nuclear fluorescent stain solution, propidium iodide (PI, Sigma-Aldrich, St Louis Mo., USA), at a concentration of 1 μg/ml. After PI staining for 5 minutes, the cells were washed with PBS three times and fixed by polyvinyl alcohol mountain medium with DABCO (Fluca, St Louis Mo., USA). The intracellular distribution of the fluorescence was determined at the middle of a single cell analyzed by confocal laser scanning microscopy, where the results are shown in FIGS. 10 a to 10 i. Parameters specific for each fluorochrome were followed as FITC: excited at 488 nm light, detected with a 530 nm bandpass filter.

Surprisingly, as shown in FIGS. 10 a to 10 i, 11 a, and 11 b, the FITC-conjugated His-MTD-EGFP recombinant proteins were well distributed largely in either the cytoplasm or the nucleus (JO-13, -18, -49, -58, -68, -101, -108, -116, -118, -122, -123, -127, -132, -133, -136, -138, -140, -148, -162, -169, -170, and -172) or both, as compared with the negative control protein fused to a scramble peptide. Ten MTDs (JO-17, -18, -21, -31, -41, -49, -88, -107, -116, and -169), which exhibited lower cell permeability than 0.5-fold as compared with the positive control (1.0-fold), showed weak intracellular localization ability after an 1 hour treatment. These results completely coincided with those from the flow cytometry analysis.

Example 6 Determination of In Vivo Tissue Distribution of Recombinant Proteins Fused to MTDs

Forty-three recombinant proteins were selected from 111 FITC-conjugated recombinant proteins tested by flow cytometry for the analysis of in vivo tissue distribution ability. FITC only (DMEM plus free FITC), a negative control (FITC-conjugated recombinant protein fused to a scramble peptide), a positive control (FITC-conjugated recombinant protein fused to a kFGF4-derived MTD), or the His-MTD-EGFP recombinant proteins (FITC-conjugated recombinant proteins fused to selected novel MTDs) were intraperitoneally injected into Balb/c mice (6 weeks, female, Central Lab. Animal Inc., Korea) at a dose of 300 μg/500 μl/mouse, respectively. After 2 hours, six organs (i.e., liver, kidney, spleen, lung, heart, and brain) of each mouse were extracted, washed with PBS, and quickly frozen with an O.C.T. compound (Sakura, Japan) on dry ice. Cryosections (20 μm thickness) of each organ were prepared by a microtome cryostat (Sakura), placed on glass slides and, then, mounted in a Vectashield mounting medium (Vecta lab, Burlingame Calif., USA). In vivo tissue distribution of selected novel MTDs was analyzed by using a fluorescence microscope (Nikon, Japan), where the results are shown in FIGS. 12 a to 12 i.

In the negative control, there was no significant level of fluorescence. In contrast, the positive control protein fused to kFGF4-derived MTD showed well distributed fluorescence activity in all of the organs. With respect to the mice injected with the recombinant proteins fused to selected novel MTDs, most MTDs having 0.5-fold or higher cell permeability than the positive control as determined by flow cytometry showed relatively strong in vivo tissue distribution ability in all of the organs, although there were some minor differences among the organs. In particular, JO-13 (1.3-fold) and JO-133 (1.5-fold) MTDs were distributed well in all of the organs except the brain. These results indicate that the novel MTDs demonstrated their strong cell permeability and/or intracellular localization ability by flow cytometry and confocal laser scanning microscopy and can effectively mediate macromolecule intracellular transduction in vitro and in vivo.

In summary, the macromolecule transduction domain (MTD) peptides identified according to the method of the present invention have the characteristics of inducible protein expression, purification and cell permeability as represented in FIGS. 13 a to 13 k.

The present invention has been described in detail with reference to specific embodiments thereof. However, it will be appreciated by those skilled in the art that changes may be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents. 

1. An isolated macromolecule transduction domain (MTD) peptide capable of mediating the transport of a biologically active molecule into a cell, wherein the peptide has an amino acid sequence selected from the group consisting of SEQ ID NOS: 1-193.
 2. An isolated polynucleotide encoding the macromolecule transduction domain (MTD) peptide according to claim
 1. 3. The isolated polynucleotide according to claim 2, wherein the polynucleotide has a nucleotide sequence selected from the group consisting of SEQ ID NOS: 194-386.
 4. A method of identifying a macromolecule transduction domain (MTD) peptide capable of mediating the transport of a biologically active molecule into a cell, comprising: identifying secreted proteins having a signal sequence-like domain from multiple amino acid sequence databases; selecting from said identified secreted proteins hydrophobic peptides having a single hydrophobic region at their N-terminus and forming a helix structure; and optimizing and minimizing said selected hydrophobic peptides under conditions effective to produce peptides suitable for use as an MTD peptide.
 5. The method according to claim 4, wherein the signal sequence-like domain is selected from the group consisting of penetratin (Arg-Gln-Ile-Lys-Ile-Trp-Phe-Gln-Asn-Arg-Arg-Met-Lys-Trp-Lys-Lys (SEQ ID NO: 592)), the Tat peptide derived from the transactivating protein Tat of HIV-1 (Arg-Lys-Lys-Arg-Arg-Gln-Arg-Arg-Pro (SEQ ID NO: 593)), transportan (Gly-Trp-Thr-Leu-Asn-Ser-Ala-Gly-Tyr-Leu-Leu-Gly-Lys-Ile-Asn-Lys-Ala-Leu-Ala-Ala-Leu-Ala-Lys-Lys-Ile-Leu (SEQ ID NO: 594)), Buforin II (Thr-Arg-Ser-Ser-Arg-Ala-Gly-Leu-Gln-Phe-Arg-Val-Gly-Arg-Val-His-Arg-Leu-Leu-Arg-Lys (SEQ ID NO: 595)), MAP (model amphiphatic peptide: Lys-Leu-Ala-Leu-Lys-Ala-Ser-Leu-Lys-Ala-Leu-Lys-Ala-Ala-Leu-Lys-Leu-Ala (SEQ ID NO: 596)), k-FGF (Ala-Ala-Val-Ala-Leu-Leu-Pro-Ala-Val-Leu-Leu-Ala-Leu-Leu-Ala-Pro (SEQ ID NO: 597)), Ku 70 (Val-Pro-Met-Leu-Lys-Pro-Met-Leu-Lys-Glu (SEQ ID NO: 598)), prion (Met-Ala-Asn-Leu-Gly-Tyr-Trp-Leu-Leu-Ala-Leu-Phe-Val-Thr-Met-Trp-Thr-Asp-Val-Gly-Leu-Cys-Lys-Lys-Arg-Pro-Lys-Pro (SEQ ID NO: 599)), pVEC (Leu-Leu-Ile-Ile-Leu-Arg-Arg-Arg-Ile-Arg-Lys-Gln-Ala-His-Ala-His-Ser-Lys (SEQ ID NO: 600)), pep-1 (Lys-Glu-Thr-Trp-Trp-Glu-Thr-Trp-Trp-Thr-Glu-Trp-Ser-Gln-Pro-Lys-Lys-Lys-Arg-Lys-Val (SEQ ID NO: 601)), SynB1 (Arg-Gly-Gly-Arg-Leu-Ser-Tyr-Ser-Arg-Arg-Arg-Phe-Ser-Thr-Ser-Thr-Gly-Arg (SEQ ID NO: 602)), pep-7 (Ser-Asp-Leu-Trp-Glu-Met-Met-Met-Val-Ser-Leu-Ala-Cys-Gln-Tyr (SEQ ID NO: 603)), and HN-1 (Thr-Ser-Pro-Leu-Leu-Ile-His-Asn-Gly-Gln-Lys-Leu (SEQ ID NO: 604)).
 6. The method according to claim 4, wherein the amino acid sequence database is selected from the group consisting of the PubMed Entrez Protein Database.
 7. The method according to claim 4, wherein said optimizing and minimizing step is carried out by the following steps: eliminating non-hydrophobic amino acids at the left or right side of the selected hydrophobic peptide to minimize the peptide length while maintaining the hydrophobic region thereof; replacing non-hydrophobic amino acids at the middle or right side of the selected hydrophobic peptide with a hydrophobic amino acid, proline, to provide flexibility; and minimizing the number of repeated hydrophobic amino acids to reduce the length of the peptide.
 8. The method according to claim 4, wherein the MTD peptide produced as a result of said optimizing and minimizing exhibits cell permeability, hydrophobicity, and flexibility, has about 7 to about 17 amino acids in length, and forms a helix structure.
 9. The method according to claim 8, wherein the MTD peptide has an amino acid sequence selected from the group consisting of SEQ ID NOS: 1-193.
 10. An isolated recombinant protein having cell permeability comprising: a macromolecule transduction domain (MTD) peptide having an amino acid sequence selected from the group consisting of SEQ ID NOS: 1-193; and a biologically active molecule.
 11. The isolated recombinant protein according to claim 10, wherein the biologically active molecule is selected from the group consisting of proteins, polypeptides, and peptides.
 12. The isolated recombinant protein according to claim 10, wherein the biologically active molecule is selected from the group consisting of growth factors, enzymes, transcription factors, toxins, antigenic peptides, antibodies, and antibody fragments.
 13. The isolated recombinant protein according to claim 12, wherein the biologically active molecule is selected from the group consisting of enzymes, hormones, transport proteins, immunoglobulin, antibodies, structural proteins, motor function proteins, receptors, signaling proteins, storage proteins, membrane proteins, transmembrane proteins, internal proteins, external proteins, secret proteins, virus proteins, native proteins, glicoproteins, cleaved proteins, proteins with disulfide bonds, protein complexes, chemically modified proteins and prions.
 14. The isolated recombinant protein according to claim 10, wherein the biologically active molecule is selected from the group consisting of nucleic acids, coding nucleic acid sequences, mRNAs, antisense RNA molecules, carbohydrates, lipids, and glycolipids.
 15. The isolated recombinant protein according to claim 10, wherein the biologically active molecule is a therapeutic agent.
 16. The isolated recombinant protein according to claim 15, wherein the biologically active molecule is selected from the group consisting of therapeutic drugs and toxic chemicals.
 17. A method of genetically engineering a biologically active molecule having cell permeability comprising attaching a macromolecule transduction domain (MTD) peptide to a biologically active molecule under conditions effective to produce a biologically active molecule having cell permeability, wherein the MTD peptide has the amino acid sequence selected from the group consisting of SEQ ID NOS: 1-193.
 18. The method according to claim 17, wherein said attaching comprises attaching a MTD peptide to the N-terminal, C-terminal, both or middle of the biologically active molecule.
 19. The method according to claim 17, wherein said attaching comprises attaching a MTD peptide to the biologically active molecule by a peptide bond.
 20. The method according to claim 17, wherein said attaching comprises attaching a MTD peptide to the biologically active molecule by a covalent bond.
 21. A method of transporting a biologically active molecule into a cell in a subject comprising administering to a subject a cell permeable recombinant protein comprising a macromolecule transduction domain (MTD) peptide attached to a biologically active molecule, wherein the MTD peptide has an amino acid sequence selected from the group consisting of SEQ ID NOS: 1-193.
 22. Use of a macromolecule transduction domain (MTD) peptide for drug delivery, wherein the MTD peptide has an amino acid sequence selected from the group consisting of SEQ ID NOS: 1-193.
 23. Use of a macromolecule transduction domain (MTD) peptide for vaccine administration, wherein the MTD peptide has an amino acid sequence selected from the group consisting of SEQ ID NOS: 1-193.
 24. Use of a macromolecule transduction domain (MTD) peptide for protein therapy, wherein the MTD peptide has an amino acid sequence selected from the group consisting of SEQ ID NOS: 1-193.
 25. Use of a macromolecule transduction domain (MTD) peptide for gene therapy, wherein the MTD peptide has an amino acid sequence selected from the group consisting of SEQ ID NOS: 1-193. 